REPORT  DOCUMENTATION  PAGE 


Form  Approved  OMB  No.  0704-0188 


Public  reporting  burden  for  this  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and 
maintaining  the  data  needed,  and  completing  and  reviewing  the  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information, 
including  suggestions  for  reducing  the  burden,  to  Department  of  Defense,  Washington  Headquarters  Services,  Directorate  for  Information  Operations  and  Reports  (0704-0188),  1215  Jefferson 
Davis  Highway,  Suite  1204,  Arlington,  VA  22202-4302.  Respondents  should  be  aware  that  notwithstanding  any  other  provision  of  law,  no  person  shall  be  subject  to  any  penalty  for  failing  to  comply 
with  a  collection  of  information  if  it  does  not  display  a  currently  valid  OMB  control  number. 

PLEASE  DO  NOT  RETURN  YOUR  FORM  TO  THE  ABOVE  ADDRESS. 


1.  REPORT  DATE  (DD-MM-YYYY)  2.  REPORT  TYPE  3.  DATES  COVERED  (From  -  To) 

20-05-2009  Final  Report  1  December  2007  -  15-Dec-09 


4.  TITLE  AND  SUBTITLE 


Design  and  Control  of  Wireless  Mobile  Adhoc  and  Sensor  Networks 


5a.  CONTRACT  NUMBER 

FA8655-08-1  -3020 


5b.  GRANT  NUMBER 


5c.  PROGRAM  ELEMENT  NUMBER 


6.  AUTHOR(S) 


5d.  PROJECT  NUMBER 


Professor  Leandros  Tassiulas 


5d.  TASK  NUMBER 


5e.  WORK  UNIT  NUMBER 


7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESS(ES) 

University  of  Thessaly 

37  Glavani  -  28th  October  Str,  Deligiorgi  Building,  5th  floor 

Volos  38221 

Greece 


8.  PERFORMING  ORGANIZATION 
REPORT  NUMBER 


9.  SPONSORING/MONITORING  AGENCY  NAME(S)  AND  ADDRESS(ES) 


10.  SPONSOR/MONITOR’S  ACRONYM(S) 


EOARD 

Unit  4515  BOX  14 
APO  AE  09421 


11.  SPONSOR/MONITOR’S  REPORT  NUMBER(S) 

Grant  08-3020 


12.  DISTRIBUTION/AVAILABILITY  STATEMENT 


Approved  for  public  release;  distribution  is  unlimited. 


14.  ABSTRACT 

In  this  project  we  investigated  challenges  of  wireless  networks  design  with  focus  on  energy  management  on  battery  dependent  devices  and  the 
impact  of  a  new  hidden  terminal  problem  on  the  operational  efficiency  of  wireless  networks.  The  results  summarized  in  this  report  have  been 
presented  in  a  number  of  papers  published  or  submitted  for  publication.  In  the  first  part  of  this  work  we  consider  the  optimal  control  of  wireless 
networks  operating  with  rechargeable  batteries  under  general  arrival,  channel  and  recharge  processes.  The  objective  is  to  maximize  total  system 
utility  while  satisfying  energy/power  constraints.  Starting  from  a  downlink  scenario,  we  propose  a  policy  with  decoupled  admission  control  and  power 
allocation  decisions  that  achieves  asymptotic  optimality  for  sufficiently  large  battery  capacity.  Extensions  to  single-hop  and  multihop  networks  are  also 
presented.  Such  policies  are  particularly  suitable  for  satellite  downlinks  or  sensor  networks.  In  the  second  part  of  this  work,  we  investigated 
performance  unfairness  (in  terms  of  throughput  and  delay)  on  the  Medium  Access  Control  (MAC)  mechanism  of  802.1 1  standard  in  the  presence  of 
hidden  terminals.  To  eliminate  the  impacts  from  the  physical  layer  (PHY),  we  assume  good  channel.  In  this  case,  we  may  intuitively  expect  that  all 
stations  have  equal  success  probabilities  in  their  transmissions.  This  intuition  is  from  the  random  access  scheme  of  Carrier  Sense  Multiple  Access 
with  Collision  Avoidance  (CSMA/CA)  which  is  employed  by  802.11  MAC  protocol.  However,  we  will  show  that  the  intuition  is  not  true  due  to  hidden 
terminals.  Instead,  the  following  fact  is  revealed  in  this  work:  The  success  probability  of  a  transmission  is  location-dependent.  More  specifically,  the 
nodes  far  from  the  access  point  see  more  hidden  terminals  than  those  close  to  the  access  point,  so  they  experience  more  collisions  and  thus  smaller 
success  probabilities.  We  build  a  model  to  analyze  the  throughput  and  collision  probability  for  nodes  at  different  locations,  validate  them  via 
simulations  and  compare  with  the  measurement  results  from  experiments. 


15.  SUBJECT  TERMS 

EOARD,  Distributed  sensors,  Complex  Systems,  Ad-hoc  Networks 


16.  SECURITY  CLASSIFICATION  OF: 

a.  REPORT 

UNCLAS 

b.  ABSTRACT 

UNCLAS 

c.  THIS  PAGE 

UNCLAS 

17.  LIMITATION  OF 
ABSTRACT 
UL 


18,  NUMBER 
OF  PAGES 


19a.  NAME  OF  RESPONSIBLE  PERSON 

JAMES  LAWTON  Ph.  D. 

19b.  TELEPHONE  NUMBER  (Include  area  code) 

+44  (0)1895  616187 


Standard  Form  298  (Rev.  8/98) 

Prescribed  by  ANSI  Std.  Z39-18 


Adhoc  wireless  network  control:  energy  efficiency 
and  hidden  terminal  considerations 


Abstract 

In  this  project  we  investigated  challenges  of  wireless  networks  design  with  focus  on  energy  management  on  battery  dependent 
devices  and  the  impact  of  a  new  hidden  terminal  problem  on  the  operational  efficiency  of  wireless  networks.  The  results  summarized 
in  this  report  have  been  presented  in  a  number  of  papers  published  [1],  [2]  or  submitted  for  publication  [3],  [4],  In  the  first  part 
of  this  work  we  consider  the  optimal  control  of  wireless  networks  operating  with  rechargeable  batteries  under  general  arrival, 
channel  and  recharge  processes.  The  objective  is  to  maximize  total  system  utility  while  satisfying  energy/power  constraints.  Starting 
from  a  downlink  scenario,  we  propose  a  policy  with  decoupled  admission  control  and  power  allocation  decisions  that  achieves 
asymptotic  optimality  for  sufficiently  large  battery  capacity.  Extensions  to  single-hop  and  multihop  networks  are  also  presented. 
Such  policies  are  particularly  suitable  for  satellite  downlinks  or  sensor  networks.  In  the  second  part  of  this  work,  we  investigated 
performance  unfairness  (in  terms  of  throughput  and  delay)  on  the  Medium  Access  Control  (MAC)  mechanism  of  802.11  standard 
in  the  presence  of  hidden  terminals.  To  eliminate  the  impacts  from  the  physical  layer  (PHY ),  we  assume  good  channel.  In  this 
case,  we  may  intuitively  expect  that  all  stations  have  equal  success  probabilities  in  their  transmissions.  This  intuition  is  from  the 
random  access  scheme  of  Carrier  Sense  Multiple  Access  with  Collision  Avoidance  (CSMA/CA)  which  is  employed  by  802.11 
MAC  protocol.  However,  we  will  show  that  the  intuition  is  not  true  due  to  hidden  terminals.  Instead,  the  following  fact  is  revealed 
in  this  work:  The  success  probability  of  a  transmission  is  location-dependent.  More  specifically,  the  nodes  far  from  the  access 
point  see  more  hidden  terminals  than  those  close  to  the  access  point,  so  they  experience  more  collisions  and  thus  smaller  success 
probabilities.  We  build  a  model  to  analyze  the  throughput  and  collision  probability  for  nodes  at  different  locations,  validate  them 
via  simulations  and  compare  with  the  measurement  results  from  experiments. 
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I.  Introduction 

In  the  concept  of  that  project  we  investigate  the  challenges  of  wireless  networks  design  with  focus  on  energy  management 
on  battery  dependent  devices  and  the  impact  of  hidden  terminals  on  their  operational  efficiency.  This  study  provides  ideas 
to  explore  and  develop  through  that  work.  The  report  is  organized  in  two  parts.  The  fisrt  one  is  affiliated  with  the  energy 
managment  on  limited-energy  devices  and  the  latter  with  the  operational  cost  of  hidden  terminals. 

Efficient  energy  management  is  a  crucial  component  of  wireless  network  design,  since  it  can  lead  to  increased  throughput  and 
network  lifetime.  The  latter  concept,  which  has  numerous  application-dependent  definitions,  is  meaningful  for  battery  operated 
devices  that  do  not  have  energy-harvesting  capabilities  (i.  e.  they  cannot  recharge  themselves  from  ambient  sources)  and, 
hence,  become  inactive  once  they  run  out  of  energy.  On  the  other  hand,  there  exist  applications  where  the  wireless  transmitters 
can  replenish  their  batteries.  Two  common  examples  are  solar-paneled  satellites  (where  the  recharge  process  depends  on  the 
satellite’s  exposure  to  sunlight  during  its  orbit  and  can  be  determined  a  priori  with  high  confidence)  and  sensor  networks  [5] 
(which  harvest  solar  or  wind  energy  so  that  the  recharge  process  depends  on  atmospheric  conditions  and  is  better  modeled  as  a 
stochastic  process).  Such  rechargeable  systems  are  usually  regarded  as  having  infinite  lifetime,1  so  that  long-term  performance 
metrics  become  appropriate. 

This  document  initially  considers  the  control  of  a  wireless  downlink  operating  in  discrete  time  under  rechargeable  batteries, 
and  later  generalizes  the  analysis  to  single-hop  and  multihop  networks.  The  relevant  literature  has  greatly  expanded  in  recent 
years,  with  most  works  being  based  on  a  dynamic  programming  (DP)  and/or  Markov  decision  process  (MDP)  approach.  Finite 
horizon  control  problems  are  studied  in  [7],  for  a  single  satellite  downlink  subject  to  stochastic  power  demands  and  rewards 
(although  no  packet  dynamics  are  included),  and  [8],  for  a  multihop  network  where  the  nodes  have  knowledge  of  the  future 
short-term  recharge  process  (an  assumption  we  dispense  with  in  this  work).  The  model  in  [8]  is  inspired  by  the  virtual  circuit 
concept  and,  as  such,  performs  resource  allocation  on  a  one-shot  basis  for  each  accepted  service  request  (the  latter  is  described 
by  the  source  and  destination  nodes  as  well  as  the  associated  gained  revenue).  Specifically,  a  policy  is  developed  that,  for  each 
accepted  request,  computes  an  appropriate  energy-weighted  shortest  path  from  the  source  to  the  destination  node  of  the  request 
and  simultaneously  reduces  the  energy  of  all  nodes  lying  on  the  selected  path  by  the  required  cost  (provided  that  the  gained 
revenue  is  larger  than  the  expended  cost).  Since  the  policy  drops  any  requests  that  cannot  be  immediately  served,  queueing 
effects  are  ignored.  Additionally,  the  link-based  energy  costs  do  not  depend  on  channel  variations. 

In  [9],  a  rechargeable  group  of  cells  under  realistic  battery  fatigue  is  examined,  where  the  objective  is  maximization  of  the 
energy  delivered  from  the  battery.  Open  and  closed  loop  policies  for  finite  and  infinite  horizon  control  also  appear  in  [10]. 
Especially  for  sensor  networks,  [11]  examines  a  scenario  where  the  derived  utility  (namely,  probability  of  event  detection) 
depends  only  on  the  number  of  active  sensors.  The  sensors  are  either  recharging  or  transmitting  (but  not  both)  and,  once 
completely  drained,  can  only  activate  themselves  when  fully  recharged.  An  upper  bound  for  any  feasible  policy  is  derived  and 
distributed  threshold  policies  are  proposed  with  a  multiplicative  guaranteed  bound  of  1/2  w.  r.  t.  the  optimal  policy.  In  [12], 
the  sensors  are  allowed  to  be  activated  even  when  partially  recharged  and  an  asymptotically  optimal  (w.  r.  t.  battery  capacity) 
policy  is  proposed  for  Poisson  and  exponential  recharge/discharge  processes,  respectively.  Although  the  last  two  works  don’t 
use  DP  (and  hence  scale  well),  the  analytically  derived  bounds  hold  for  a  rather  ideal  setting.  Finally,  [13]  focuses  on  the 
temporal  correlations  between  the  events  sensed  by  a  single  sensor  and  studies,  in  terms  of  an  MDP,  the  structure  of  the  optimal 
policy  under  various  observability  conditions.  Efficient  suboptimal  policies  are  also  proposed  and  numerically  evaluated  for 
multiple  node  networks  in  [14].  References  [11  ]— [  14]  focus  exclusively  on  the  sensing  aspect  of  sensor  networks  rather  than 
the  network  flow  of  information  (i.  e.  what  happens  once  an  event  is  detected),  which  is  the  main  focus  of  this  document. 

Our  model  is  distinct  from  the  aforementioned  works  and  is  directly  influenced  by  the  cross-layer  stochastic  optimization 
framework  of  [15],  [16],  which,  in  turn,  was  inspired  by  [17],  This  framework  was  applied  in  [18],  which  proposed  optimal 
adaptive  backpressure  policies  (collectively  referred  to  as  ABP)  for  non-rechargeable  power-constrained  networks  under  long¬ 
term  average  constraints,  and  introduced  the  concept  of  virtual  queues  to  handle  the  latter,  while  [19]  addressed  fairness  issues 
in  a  similar  setting.  Unfortunately,  the  above  methodology  relies  crucially  on  the  fact  that  available  controls  depend  only  on 
current  channel  states  and  are  independent  of  prior  history,  and  this  condition  is  violated  in  our  model,  as  will  become  apparent 
later.  Hence,  this  approach  is  not  directly  applicable.  Our  rechargeable  model  explicitly  takes  into  account  wireless  channel 
variations  and  packet  dynamics  and  aims  in  maximizing  a  suitably  defined  measure  of  user  satisfaction,  while  imposing  minimal 
assumptions  on  the  arrival,  recharge  and  channel  processes  (the  basic  assumption  is  that  these  processes  are  rate-convergent,  as 
defined  in  [15]).  Since  the  lifetime  of  rechargeable  systems  is  practically  infinite,  the  inclusion  of  packet  dynamics  forces  us 
to  study  the  stability  properties  of  any  policy  under  consideration.  An  adaptive  stabilizing  policy  is  proposed  with  guaranteed 
performance  bounds  that  imply  asymptotic  optimality  for  sufficiently  large  battery  capacities. 

This  report  is  structured  as  follows.  Subsection  II-A  contains  the  system  model  and  problem  statement  for  a  downlink 
scenario  and  also  presents  a  connection  with  an  infinite  battery  problem  which  offers  important  insight.  Subsection  II-B 
introduces  the  concept  of  virtual  queues  and  proposes  a  policy  for  a  rechargeable  battery.  Stability  and  performance  analysis 

’if  we  take  into  account  battery  fatigue  (i.  e.  the  fact  that  recharge  capability  deteriorates  as  the  number  of  recharge  cycles  increases),  the  lifetime  is.  strictly 
speaking,  finite.  However,  as  shown  in  [6],  proper  design  can  lead  to  a  lifetime  of  more  than  a  year,  for  continuous  operation  (and  even  more  for  typical  duty 
cycles),  which  can  be  considered  as  infinite  for  engineering  purposes. 
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for  this  policy  is  carried  out  in  detail.  The  gained  experience  is  used  in  Subsection  1I-C  to  propose  policies  for  single-hop  and 
multihop  networks,  with  the  latter  requiring  a  more  complex  analysis  (performance  bounds  are  provided  in  all  cases) 

As  a  predominant  wireless  local  area  network  (WLAN)  technology,  IEEE  802.11  [20]  has  defined  both  contention-free 
point  coordination  function  (PCF)  and  contention-based  distributed  coordination  function  (DCF)  for  medium  access  control 
(MAC).  Nonetheless,  only  the  DCF  protocol,  which  is  a  random  access  scheme  based  on  carrier  sensing  multiple  access  with 
collision  avoidance  (CSMA/CA),  has  been  widely  deployed  in  the  field.  The  DCF  protocol  has  further  incorporated  a  ready-to- 
send/clear-to-send  (RTS/CTS)  handshake  mechanism  to  mitigate  the  hidden  terminal  problem ,  which  causes  great  performance 
degradation  if  left  unattended  [21], 

Most  research  on  the  hidden  terminal  problem  has  been  focused  on  the  performance  degradation  to  the  overall  network 
[22]-[23],[24],  This  work,  however,  focuses  on  the  level  of  the  degradation  imposed  on  individual  nodes.  We  find  that  the  level 
of  performance  degradation  for  a  node  is  location-dependent.  More  specifically,  it  depends  on  the  distance  from  the  access 
point  (AP).  As  we  will  see  from  the  measurement  results  in  section  III-F,  stations  at  different  distances  from  the  AP  experience 
significantly  different  collisions  probabilities  and  consequently  achieve  disparate  throughput.  Similar  observations  are  made 
from  our  analysis  and  simulations  (section  III-D).  In  a  16-node  WLAN,  we  observe  that  the  throughput  ratio  between  a  node 
close  to  the  AP  and  a  node  at  the  edge  of  the  network  ranges  from  three  times  to  as  much  as  five  times.  In  order  to  establish 
a  solid  theoretical  foundation,  we  propose  an  analytical  model  for  WLANs  in  the  presence  of  hidden  terminals.  To  further 
demonstrate  the  performance  unfairness  in  practice,  we  further  report  results  from  experiments  carried  out  in  real  environments. 

We  will  mainly  focus  on  the  scenario  with  RTS/CTS  handshake  in  our  analytical  model,  since  the  measurement  results  to 
be  reported  in  section  III-F  corroborate  that  WiFi  network  performance  in  the  presence  of  hidden  nodes  is  much  worse  when 
RTS/CTS  is  turned  off  as  compared  to  the  case  where  RTS/CTS  is  switched  on. 

The  rest  of  the  second  part  will  be  organized  as  follows.  Subsection  III-G  briefly  summarizes  the  related  work  in  the 
literature.  Subsection  III-A  describes  the  hidden  node  problem  and  the  resulting  performance  unfairness.  Subection  III-B  builds 
an  analytical  model  for  a  802.11  WLAN.  Subsection  III-D  validates  the  model  with  simulations.  Subsection  III-F  presents  the 
measurement  results  from  our  testbed.  Finally,  the  Appendix  provides  the  derivations  of  the  hidden  areas  and  Eq.  107. 

II.  Asymptotically  optimal  control  of  wireless  networks  with  rechargeable  batteries 
A.  System  model  and  problem  statement 

As  stated  in  the  Introduction,  the  proposed  policy  applies  to  generic  multihop  networks  under  arbitrary  rate-convergent 
arrival,  channel  state  and  (energy)  recharge  processes.  However,  in  order  to  simplify  the  discussion  and  gain  essential  insight 
into  the  operation  of  the  policy,  we  initially  present  the  analysis  for  a  downlink  scenario  with  iid  arrival,  channel  and  recharge 
processes  and,  in  a  later  section,  describe  the  modifications  needed  for  the  general  case.  Non-iid  processes  can  be  handled  by 
the  T-slot  analysis  [16]  of  rate-convergent  processes  and  are  not  covered  here. 

We  consider  a  time-slotted  system  where  slot  t  corresponds  to  the  time  interval  [t  t  +  1).  There  exists  a  single  base  node 
that  transmits  to  L  users  (i.  e.  there  exist  L  wireless  downlinks2).  The  link  channels  are  time-varying  so  that  we  denote  with 
S(t)  the  channel  state  at  slot  t  and  assume  it  is  iid  distributed  over  a  finite  set  S,  i.  e.  Sit)  £  S  for  all  t.  We  also  define 
7 is  =  Pr  (S(t)  =  s),  where,  with  no  loss  of  generality,  it  holds  ng  >0  Vs.  Channel  conditions  remain  constant  for  the 
duration  of  each  slot  but  may  change  at  slot  boundaries.  The  base  node  has  a  distinct  traffic  stream  for  each  downlink  so  that 
Afit)  stands  for  the  number  of  exogenously  generated  bits  during  slot  t  that  must  be  transmitted  to  (downlink)  user  l.  These 
bits  enter  the  transport  layer  and  are  stored,  in  a  FIFO  manner,  in  an  external  queue  awaiting  admission  into  the  network  layer 
and  subsequent  transmission.  Admission  control  is  necessary  for  the  case  of  heavy  traffic  that  exceeds  the  network  capacity. 
When  the  bits  are  admitted  into  the  network  layer,  they  are  first  stored  in  an  internal  “network”  queue,  again  in  FIFO  fashion, 
until  transmission  occurs.  Let  Vfit),  Ufit)  be,  respectively,  the  number  of  bits  stored  at  time  t  in  the  external/internal  queues 
of  the  base  node  and  destined  for  user  l  (through  link  l).3  Denote  with  /?/  ,„(/)  the  number  of  bits  moved  from  the  external 
to  the  internal  queue  (i.  e.  admitted  into  the  network)  of  link  l  at  time  t,  and  define4  Rin(t)  =  Supply/demand 

constraints  clearly  require  Runfi)  <  Vfit)  +  Afit)  for  all  l,t.  It  is  further  assumed  that  each  exogenous  arrival  process  Afit) 
has  a  deterministic  upper  bound  Ai  (i.  e.  Afit)  <  Ai  fit)  and  is  iid  distributed  with  expected  value  A/.  We  will  write  at  times 
A={At)^ i  and  A  4  (A 

The  base  node  is  equipped  with  a  rechargeable  battery  of  maximal  energy  Emax.  The  battery  level  at  time  t  is  denoted  as 
E(t).  The  battery  energy  is  depleted  due  to  link  transmissions  but  is  also  replenished  due  to  a  recharge  process.  Specifically, 
we  denote  with  B(t)  the  amount  of  replenished  energy  during  slot  t ,  where  B(t)  is  assumed  bounded5  (i.  e.  B{t)  <  B  Vt) 
and  iid  distributed  with  an  expected  value  of  B.  At  the  beginning  of  each  time  slot  t,  the  network  controller  chooses  a  power 
vector  P{t)  =  (Pi(t))f=1,  where  Pi(t)  is  the  selected  transmission  power  in  link  l  during  slot  t.  Available  transmission  power 

2in  the  following,  the  terms  user  l  and  link  /  will  be  used  interchangeably,  since  there  exists  a  1-1  correspondence  between  them. 

3  for  economy  of  expression,  we  will  refer  to  Ui(t)  as  the  internal  queue  of  link  /,  and  similarly  for  Vi(t). 

4we  hereafter  use  bold  face  to  denote  vector  quantities.  The  vector’s  dimension  will  be  apparent  from  context. 

5  the  deterministic  bound  assumptions  on  Ai  (t)  and  B(t)  can  be  replaced  by  the  weaker  conditions  of  finite  second  moments,  which  leads  to  slightly  modified 
performance  results  (although  the  proposed  policy  is  unaffected).  We  choose  to  stay  with  the  former  assumption  in  order  not  to  obscure  the  discussion. 
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vectors  may  be  channel-state  dependent,  i.  e.  P(t)  £  Vs  whenever  S(J)  =  s,  where  Vs  is  a  finite  set6  for  all  s  £  S.  The 
only  constraints  imposed  on  Vs  are  that  for  all  s  £  S  it  holds  0  £  Vs  and  YI1L1P1  —  ^  Vp  £E  'Ps,  where  P  <  Emax.  The 
first  constraint  states  that  the  system  may  always  select  to  decline  transmission  while  the  second  constraint  models  hardware 
limitations  or  standard  regulations.  The  existence  of  the  finite-energy  battery  further  restricts  the  available  power  vectors  for 
slot  t  by  the  natural  condition  J2t=iPi  —  -E(i)  Vp  £  Ps^ty 

To  facilitate  analysis,  it  is  assumed  that  all  values  of  the  recharge  process  B(t )  as  well  as  all  members  of  Vs  are  integer 
multiples  of  an  arbitrary  constant  (i.  e.  they  are  quantized),  so  that  E(t)  effectively  takes  values  in  a  countable  set.  For  a  given 
state  S(i)  and  selected  power  P(t),  the  transmission  rate  p(i)  =  (pi{t))f. =1  in  slot  t  (i.  e.  the  number  of  bits  that  can  be 
transmitted  in  each  link)  is  upped  bounded,  component-wise,  by  a  vector  function  c(S(t),P(t))  that  satisfies  the  following 
properties 

•  it  holds  ci  (,s.  p)  =  0  for  any  s,  p  such  that  pi  =  0.  A  consequence  of  the  previous  statement  is  the  fact  that  c  f.s.  0)  =  0. 

•  for  any  ,s.  p,  and  for  all  l,  it  holds  c/  (,s.  p)  <  q  f.s.  p)  where  fp,  =  iikpk  and  h;/i:  is  Kronecker’s  delta.  The  function 
ci  f.s.  p)  is  non-decreasing,  differentiable  and  concave  w.  r.  t.  pi.  Interpreting  the  first  condition,  it  is  equivalent  to  saying 
that  the  maximum  rate  of  link  l  decreases  (for  a  fixed  pi )  as  the  other  links  are  assigned  non-zero  power,  which  follows 
from  interference  properties.  The  second  condition  is  also  typical  of  most  rate  functions  and  appears  often  in  literature. 

•  it  holds  ^  (s,  0)  <  oo  for  all  l  and  s. 

Since  all  sets  encountered  so  far  are  finite,  we  define  the  bound7  cj  =  ma xs&s,peVs  ci(s>P)- 

Under  the  previous  assumptions  and  notation,  the  queues  V(t )  =  U(t)  =  and  the  battery  level  E(t ) 

evolve  as8 


Vi{t  +  1)  =  Vi{t)  +  Mi)  -  (1) 

Ui{t  +  1)  =  [Ui{t)  -  pi  {S(t)M{t))}+  +  Ri,in(t),  (2) 

E(t  +  1)  =  min  ^ E(t )  -  ^  Fj  ( t )  +  B(t) 

?  Emax  j  ,  (3) 

where  [,t]+  =  max(i,  0),  subject  to  constraints 

+  V/,t,  (4) 

L 

P{t)£VS{ty  J^Piff)  <  min  ^E(t),P^  Vf,  (5) 

1=1 

0  <W(S(t))P(t))<q(S(t)1P(t))  VZ,t.  (6) 


It  can  be  argued  that  /q  (S(t),  P(t))  should  be  replaced  by  c;  (S(t),  P(t))  in  (2)  since,  intuitively,  there  is  no  benefit  in 
transmitting  fewer  bits  than  those  allowed  by  the  selected  power  (assuming  there  are  sufficient  bits  to  transmit  in  the  first 
place).  Although  this  argument  is  indeed  correct,  we  elect  to  keep  the  current  notation  and  reach  the  same  conclusion  through 
the  policy  specification  itself  rather  than  mere  intuition. 

Denote  with  Ri  (t)  the  number  of  bits  actually  transmitted  in  link  l  during  slot  t  so  that  the  corresponding  time-average  rate 
is  ^ 

f,(t)4l  ^E[fl,(r)].  (7) 

1  T= 0 

We  also  use  the  shorthand  r(t)  =  (rL(t))fi1  and  write  r(t)  =  j  E  [-R(t)]-  Each  user  l  derives  a  satisfaction  fi  ( fi(t )) 

based  on  its  current  time-average  rate  n(f),  where  the  utility  function  /;(•)  is  assumed  to  be  non-negative,  increasing, 
differentiable  and  concave.  We  also  impose  the  constraints  /;( 0)  =  0,  f[ (0)  <  oo.  The  total  system  satisfaction  g  (r(t)) 
at  time  t  is  g  ( r(t ))  =  fi  (T;(f)).  We  finally  make  the  following  natural 

Assumption  1:  At  time  t  =  0,  all  bit  queues  are  empty. 

No  assumption  is  made  regarding  the  initial  battery  level;  hence  /f(O)  can  take  any  value  in  [0  Emax\.  Loosely  speaking, 
a  decision  rule  is  a  procedure  for  selecting  the  control  variables  Rin(t),  P(t)  for  a  specific  time  slot  l  subject  to  all 
aforementioned  constraints,  while  a  policy  is  a  sequence  of  decision  rules  for  all  time  slots.  We  restrict  attention  to  policies 
that  stabilize  the  network  according  to  the  definition  of  [15].  Specifically,  a  network  with  a  composite  queue  U(t)  is  stable 
iff  limsup^oo  j  E[T)(r)]  <  oo  VI.  Note  that  the  notion  of  stability  considers  only  the  internal  queue  U(t)  since  the 

6although  in  our  case  it  holds  Vs  =  V  Vs  £  5,  since  there  is  no  reason  for  available  power  choices  to  depend  on  the  channel  state,  there  exist  scenarios 
where  the  available  powers  depend  on  a  properly  defined  state  space  (which  may  include  more  components  than  channel  conditions  only).  These  cases  also 
fall  under  the  considered  model. 

7  unless  otherwise  noted,  the  accent '  will  always  denote  an  upper  bound. 

sunless  otherwise  noted,  the  indices  l,  s.  p  will  hereafter  range,  respectively,  over  the  sets  {1. ....  L},  5,  Vs  (e.  g.  we  write  V  s  instead  of  the  formally 
correct  Vs  £  5,  etc). 
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external  queue  Vft)  is,  by  definition,  not  part  of  the  network  layer  (in  fact,  for  heavy  arrival  traffic  that  exceeds  network 
capacity,  V ( t )  must  grow  without  bound).  Hence,  we  state  our  problem  as  follows 

Problem  1:  For  the  downlink  scenario  described  by  the  queuing  evolutions  of  (l)—(3)  and  the  constraints  of  (4),  (5),  find 
a  stabilizing  policy  that  maximizes  liminft^oo  g  (f  (/)).  The  optimization  is  performed  over  the  set  of  all  stabilizing  policies, 
including  those  with  perfect  knowledge  of  future  events.  Denote  the  optimal  objective  value  as  g*e. 

Determination  of  the  optimal  policy  in  Problem  1  is  very  challenging,  so  we  instead  seek  a  policy  with  a  system  utility  that 
is  close  to  the  optimal  one.  Since  P ,  B  are  considered  to  be  system  parameters,  we  can  fix  their  values  and  restate  the  problem 
as 

Problem  2:  Under  the  same  assumptions  as  in  Problem  1,  find  a  policy  that  for  any  e  >  0  achieves  an  objective  value  no 
less  than  g*e  —  e,  provided  that  Emax  >  E*fe)  for  a  sufficiently  large  E*{e). 

Since  g*e  clearly  depends  on  Emax,  a  better  notation  would  be  g*e  ( Ernax )  but  we  elect  to  simplify  notation  and  keep  the 
dependence  implicit.  Problems  1,  2  are  essentially  sequential  decision  problem  that  can,  in  principle,  be  attacked  with  dynamic 
programming  (DP)  and/or  Markov  chain  techniques.  However,  these  techniques  are  impractical  since  they  suffer  from  the 
dimensionality  curse  of  DP  and  require  extensive  knowledge  of  system  parameters,  e.  g.  ns,  A,  which  may  not  be  available 
to  the  network  controller.  On  the  other  hand,  the  Lyapunov  drift  framework  of  [  16]— [  19]  assumes  that  at  any  state  s,  one  may 
always  choose  any  of  the  available  controls  in  Vs-  However,  this  is  not  the  case  in  our  model,  as  evidenced  from  (5),  which 
implies  that  when  Eft)  <  P  the  available  powers  Pit)  depend  explicitly  on  the  battery  level  Eft).  Hence,  this  approach  is 
not  directly  applicable  to  our  problem.  Nevertheless,  it  suggests  the  existence  of  a  modified  policy  that  solves  Problem  2  when 
Emax  V>  B,  P.  The  intuition  behind  the  last  statement  is  explained  below. 

1 )  Some  intuitive  remarks:  Equations  (3),  (5)  imply 

t- 1  L  t- 1  t- 1 

E  E Pi  -  Emax  +  '^2b(t)  ^ lim  sup  7  E  E 

where  the  first  relation  expresses  conservation  of  energy  and  the  second  one  follows  from  the  first  by  taking  expectations, 
dividing  with  t  and  taking  a  limsup  as  t  — >  oc.  Eq.  (8)  implies  that  any  policy  (stabilizing  or  not)  acting  on  the  rechargeable 
battery  satisfies  an  average  power  constraint  of  B,  which  in  turn  allows  us  to  perform  the  following  “thought  experiment”. 
Consider  the  same  downlink  problem  as  above  and  replace  the  rechargeable  battery  with  an  infinite  capacity  battery,  which 
essentially  removes  (3)  and  the  Eft)  constraint  in  (5).  Denote  with  g*vg  the  maximum  system  utility  achieved  over  all  policies 
acting  on  the  infinite  battery  with  average  power  constraint  B  (we  collectively  refer  to  these  as  “average”  policies).  This  scenario 
is  treatable  by  the  methodology  in  [18],  which  proposes  an  ABP-based  adaptive  policy  whose  performance  can  approach  g*vg 
arbitrarily  close.  Since  it  clearly  holds  gre  <  g*vg,  Problem  2  is  solved  if  we  can  find  a  policy  (for  the  rechargeable  battery) 
that  performs  better  than  g*vg  —  e. 

At  this  point,  and  since  the  finite  energy  content  of  the  battery  affects  decisions  only  when  Eft)  <  P,  one  might  be  tempted 
to  directly  apply  the  ABP  average  policy  on  the  rechargeable  battery  (obviously,  when  ABP  selects  a  power  that  exceeds  the 
currently  available  energy,  a  best-effort  action  is  taken).  After  all,  assuming  E( 0)  =  Emax  P  and  applying  the  ABP  average 
policy,  it  would  take  a  very  long  time  for  the  battery  level  to  drop  below  P,  where  ABP  is  non-optimal.  Hence,  it  can  be 
argued  that  such  a  policy  is  optimal  “most  of  the  time”  and  the  asymptotic  optimality  is  essentially  “proved”.  The  previous 
reasoning  is  wrong  in  an  infinite  horizon  model  in  that  it  neglects  what  happens  after  the  battery  drops  below  P.  Specifically, 
if  ABP  results  in  the  battery  level  oscillating  around  P,  or,  worse,  staying  under  P  for  extended  intervals  of  time,  then  even 
though  ABP  is  optimal  for  the  time  interval  before  the  battery  dropped  below  P  for  the  first  time,  it  becomes  non-optimal 
for  long  intervals  of  time  after  that.  If  this  occurs  for  many  sample  paths,  it  is  not  evident  that  the  suggested  policy  performs 
close  to  optimal. 

The  previous  paragraph  effectively  described  a  mechanism  through  which  any  policy  acting  on  the  rechargeable  battery 
suffers  a  performance  loss  with  respect  to  the  ABP  average  policy.  The  crucial  point  is  that  it  is  not  a  priori  known  (and 
neither  is  it  intuitively  obvious)  whether  this  loss  can  be  made  arbitrarily  small.  Another  mechanism  that  leads  to  performance 
degradation  is  the  loss  of  recharged  energy  due  to  the  Emax  cap.  Specifically,  for  a  slot  t,  it  may  hold  Eft)  >  Emax  ~  B  and 
for  a  poor  channel  state  the  controller  elects  to  transmit  with  low  power  (so  that  power  is  withheld  for  better  channel  states 
which  can  offer  increased  utility)  and  when  the  battery  is  recharged,  the  replenishment  is  too  high  so  that  some  energy  is  lost 
due  to  the  Emax  cap.  Obviously,  if  the  controller  knew  in  advance  the  exact  amount  of  energy  replenishment,  it  could  elect 
to  increase  the  selected  transmitted  power  by  the  amount  of  overflow  above  Emax.  This  would  have  the  effect  that  the  battery 
energy  would  still  be  the  same  at  the  end  of  the  time  slot  but  now  the  user  has  received  a  higher  satisfaction.  Clearly,  the  ABP 
average  policy  is  immune  to  this  effect,  since  there  is  no  battery  to  overflow  in  the  first  place. 

The  previous  observations  indicate  that  a  policy  closely  resembling  the  ABP  average  policy  when  Eft)  >  P  and  under 
which  Pr(.E(i)  <  P)  and  Pr (Emax  —  B  <  Eft)  <  Emax)  are  small  should  perform  nearly  optimally.  This  is  quantified  next. 


.1=1 


<  B, 
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B.  Performance  analysis 
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1)  Virtual  queues  and  policy  specification:  We  now  focus  on  the  rechargeable  battery  setting  and,  in  the  spirit  of  [15], 
introduce  a  virtual  queue  for  each  linear  long-term  constraint.  Two  such  constraints  exist  in  our  model,  the  first  one  due  to  the 
finite  arrivals,  as  modeled  in  (1),  and  the  second  one  due  to  (8).  Hence,  we  define  virtual  queues  Y(t)  =  (Yi(t))jL1  and  D(t ) 
which  evolve  as 

Yl(t+l)  =  [Yl(t)-Rl,in(t)]+ (9) 

L 

D(t  +  1)  =  [D(t)  -  (1  -  S)B(t)}+  +  J2  (10) 

i= i 

for  some  0  <  <5  <  1,  where  7 fit)  is  an  auxiliary  process  introduced  for  mathematical  convenience.  Since  i?,ra(f), 7(f)  are 
determined  by  the  policy,  we  can  impose  arbitrary  bounds  on  them.  Specifically,  we  require  Ri^n(t),  Vi(t)  <  Ai  V l.  The  virtual 
queues  are  constructed  in  such  a  way  that  any  policy  that  stabilizes  them  also  satisfies  the  appropriate  long-term  constraints 
(this  novel  insight  was  introduced  in  [18]).  Eq.  (10)  is  different  from  the  standard  approach  of  [18],  which  would  replace 
(1  —  S)B(t)  with  B,  since  the  latter  is  the  actual  constraint  to  be  satisfied.  This  modification  is  an  essential  ingredient  of  our 
approach  and  has  important  consequences,  as  will  become  apparent  later.  Fig.  1  represents  the  interconnections  between  the 
various  physical  and  virtual  queues,  as  defined  by  their  evolutions  in  (l)-(3)  and  (9),  (10).  The  dotted  line  in  Fig.  1  is  used 


B(t) 


Eft) 


energy  quanta ; 
overflow 


E,  P,  ft) 


D(t) 


(l-S)B(t) 
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Fig.  1.  Queue  connections  in  the  downlink  scenario  with  rechargeable  battery. 
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to  examine  the  aggregate  queue  of  E(t),  D(t).  Since  the  compound  queue  has  an  arrival  rate  of  B{t)  and  a  service  rate  of  at 
most  (1  —  S)B(t),  it  is  unstable,9  a  fact  that  will  be  of  crucial  importance  later.  In  foresight,  it  will  be  used  to  show  that  the 
queue  E{t)  is  sufficiently  loaded  most  of  the  time  so  that  Pr(i?(t)  <  P)  is  small. 

To  assist  the  reader,  the  policy  description  is  given  first,  followed  by  the  stability  and  performance  analysis  in  later  sections. 
Specifically,  we  propose  the  following 

Downlink  rechargeable  adaptive  backpressure  policy  (DRABP) 

1)  at  the  beginning  of  slot  t,  observe  queues  U(t),Y(t),  V{t)  and  select  Rin(t)  bits  for  admission  into  the  network  layer 
according  to 


L 

Rin(t)  =  argmax  ^  {pYfit)  -  Ufit))  n 

1=1 

s.t.  0  <  n  <  min  (vfit)  +  VI, 

where  77  is  a  tuning  parameter  used  for  balancing  the  congestion  among  the  physical  and  virtual  queues. 


(11) 


9in  principle,  the  aggregate  queue  would  still  be  unstable  if  we  replaced  (1  —  S)B(t)  with  (1  —  ^)5  in  (10),  though  the  analysis  would  be  more  complicated. 
Also,  B  may  not  be  known  to  the  controller  while  B{t )  can  be  easily  computed. 


2)  select  7  (t)  as  the  solution  to  the  following  problem 


7  (t)  =  argmax  E  [V fi  (ji)  - 
1=1 

s.t.  0<7 1  <  Ai  V/, 

3)  observe  S(t)  and  select  P{t)  as  the  solution  to  the  following  problem 

L  L 

P{t)  =  argmax  ^  [Ui(t)fii  (S(t),p)  -  D(t)pi]  =  argmax  ^  [Ui(t)ct  (S(t),p)  -  D(t)pi\ 


(12) 


1=1 


1=1 


L, 

sX  P e  VS(ty  E  Pi  <  min  (m,p) 


(13) 


1=1 


0  <  m(s(t),p)  <  ci(s(t),p)  vi, 

4)  update  queues  V(t),Y(t),U(t),  E(t),  D(t),  in  that  order,  according  to  the  appropriate  evolution  equations. 

The  previous  policy  will  occasionally  be  denoted  as  DRABP(<5, 77,  V)  to  emphasize  its  dependence  on  these  two  parameters. 
Its  difference  from  the  policy  of  [15]  lies  in  the  presence  of  the  term  E(t)  in  the  constraint  of  (13).  As  previously  mentioned, 
this  term  causes  the  analysis  of  [15]  to  become  inapplicable  to  our  model  so  that  a  new  approach  required.  The  following 
observations  can  be  made 

•  Eq.  (11)  accepts  a  bang-bang  solution  of  the  form 


0 


lin  (vfit)  +Ai{t),A^  if  Ui(t)  <  rjYi{t), 


otherwise, 


(14) 


•  the  concave  maximization  problem  in  (12)  is  separable  and  has  a  closed  form  solution  if  (//)-1(7z)  is  analytically  known 
(the  -1  superscript  denotes  functional  inverse). 

•  the  problem  in  (13)  is  the  most  computationally  expensive  step  of  DRABP  and,  depending  on  the  form  of  c/  (•,  •)  (which 
in  turn  depends  on  network  interference  properties),  may  not  be  separable. 

Note  that  DRABP  requires  no  knowledge  of  system  parameters  (e.  g.  7 rg,  A)  and  the  intuitive  rule  of  transmitting  with  peak 
rate  for  a  giver  power  arose  naturally  through  (13).  It  remains  to  examine  the  stability  and  performance  properties  of  the  above 
policy.  The  latter  requires  some  additional  notation  and  results  to  be  introduced. 

2)  Some  preliminary  results:  The  following  lemma,  which  is  proved  in  [15],  characterizes  the  optimal  solution  to  the  infinite 
battery  with  average  power  constraint  and,  as  stated  in  Section  II-A1,  provides  an  upper  bound  for  any  rechargeable  policy 

Lemma  1:  No  downlink  stabilizing  policy  acting  on  an  infinite  capacity  battery  under  an  average  power  constraint  of  B 
can  achieve  an  objective  value  larger  than 


3a„ff=max  ^fM) 

1=1 

s.t.  r  €  1Z, 

0  <  r  <  A, 


(15) 


where 


R  =  |  r  £  l1  :  3  7Tp>0  s.t.  0  <  7  <  E  E«(-  ,p)TTpTTS  VI,  E  E  Y.p^p718  ^  E  *«  =  1  v* 


' p 


sespeVs 


sespePs  l= 1 


p&vs 


(!6) 

Clearly,  the  quantity  71 p  in  the  definition  of  'R.  is  a  discrete  pdf  on  the  set  Vs-  In  [15],  it  is  interpreted  as  the  probability  with 
which  a  randomized  policy  would  select  power  p  when  the  current  channel  state  is  .s. 

The  starting  point  for  deriving  a  performance  bound  for  any  rechargeable  policy  is  the  following  variation  on  a  known  result 
[15,  Lemma  5.3  and  Theorem  5.4] 

Lemma  2:  Consider  a  concave  function  gf)  and  stochastic  processes  Q(t ),  X{t),  Z(t).  If  there  exists  a  non-negative  function 
Ly(X(t))  (referred  to  as  a  Lyapunov  function)  such  that  E[Ly  (Jt(0))]  <  00  and  the  following  relation  is  satisfied  for  all  t 
and  some  constants  C,  V  >  0 

A  (X(t))  -  CE  [9  (Z(t))\ X(t)\  <  C  —  Vg*  +  Q(t),  (17) 


it  then  holds 


liminf  (/  >  g*  -  °  +  ® , 

t—>oo  v 


(18) 
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where  z(t)  is  the  time-average  of  Z(t),  A  (X(t))  =  E  [Ly  (X(t  +  1))  —  Ly  ( X(t ))  \X(t)]  and 


t-i 

Q  =  limsup  -  ^E[Q(j)]. 


See  Appendix  V-A  for  the  proof.  A  variant  of  Lemma  2  can  also  be  used  to  prove  stability  of  any  policy  that  satisfies  a 
modified  version  of  (17);  however,  we  will  actually  be  able  to  prove  a  stronger  result  (namely  deterministic  queue  bounds)  by 
examining  the  policy  specification  itself,  so  this  variant  is  omitted. 

We  also  define,  for  0  <  S  <  1,  the  set 

72.5  =  j  r  G  RL  :  3  7Tp  >  0  s.t.  22  PiKp-rts  <  (1  -  6)B,  0  <  n  <2^ci(s,p)  tt^t ts  V  l,  ^TTp  =  lVs|,  (20) 


as  the  set  of  rates  that  can  be  stabilized  by  a  randomized  policy  subject  to  an  average  power  constraint  of  (1  —  5)B,  and  the 
optimization  problem 

L 

9*g  =  max  ^/;(n) 

1=1  (21) 

s.t.  0  <  r  <  A, 

r  G  Tig. 

Since  0  G  Vs  V  s,  it  follows  that  Tig  f  0.  Also,  Tig  has  the  following  properties 

•  it  is  convex  and  compact 

•  r  €  Tig  implies  r\  &Tlg  V  r\  <  r. 

These  properties  are  proved  in  a  straightforward  manner  in  [15],  which  also  provides  the  following  continuity  result 

Hm  g*g=g*avg  (22) 

3)  Stability  and  performance  properties  of  DRABP:  The  following  result,  proved  in  Appendix  V-B,  provides  all  necessary 
information  for  the  stability  properties  of  DRABP 

Lemma  3:  Under  DRABP,  queues  Y(t),U(t),D(t )  are  deterministically  bounded  for  all  t  as  follows 

D(t)<— +  4^}]  Vi, 

7? 


Ui(t)  <  rjYt  +  Ai  =  Ui  Vi, 

D(t)  <  max  (UiCi)  +  P  = 
kkl  V  / 


P  =  D , 


where  C,  =  maxS6S  0,0). 

Hence,  we  have  actually  proved  a  much  stronger  result  (namely  deterministic  bounds)  than  stability  for  DRABP.  In  order  to 
derive  a  performance  bound  via  Lemma  2,  we  define  the  composite  queue  X(t)  =  ( U(t),Y(t),  D(t ))  and  the  Lyapunov 
function 

L 

Ly  (X(t))  =  £  (UlV)  +  vYt2(t))  +  D2(t)  (24) 

1  =  1 

From  (2),  (9),  (10)  and  Lemma  4.3  of  [15],  it  follows  after  a  little  algebra 

u?(t  +  1)  <  U2(t)  +  y2  ( S(t ),  P(i))  +  Rlin(t)  -  2 Ui(t)  \pi  ( S(t ),  P(t))  -  Rl>in(t)} ,  (25) 

Y2(t  + 1)  <  Y2(f)  +  R2in(t)  +  y2(t)  -  2Yi(t)  [ Ri,in(t )  -  (t)] ,  (26) 

D2(t  +  1)  <D2(t)  +  (l-S)2B2{t)  +  [22Pl^ )  -2D(*)  (l-Wt)-53^(t)  .  (27) 

\i= i  J  L  i=i 

Summing  (25),  (26)  over  l  and  using  all  the  imposed  bounds  results  in 


A  (X(t))  <22{f  +  (2r?  +  1)^i)  +B2  +  P2-  2E  22  -  Ri,in(t )]  X(t) 

1=1  L  i=i 

L  i  r  L 

-2r?E  22Y^{Rbin(t)-li(t)\  X(t)  -2D(t)E  (l-6)B(t)~22PlW  ' 

.  1=1  J  L  i=i 
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Exploiting  the  iid  nature  of  B(t),  which  implies  E  [B(t)\X(t)]  =  B,  subtracting  2FE  ^^;=1 /)  (7;(i))|  X(£) 
sides  and  rearranging  terms  yields 


from  both 


A(X(t))-2VE 


1  =  1 


X{t) 


<C  -  2E 


Y  Wfi  (7«W)  -  vYl(thl(t)} 


-  2E 


Y[W)H(S(t),P(t))-D(t)Pl(t)] 


X(t) 


1=1 


—  2  E 


X(t) 


YfoYiW-UiWRifrit) 


1=1 


X{t) 


(29) 


-  2(1  -  5)BD(t), 

where  C  =  B2  +  P2  +  J2t=i  +  (2r?  +  1  )Aifj .  The  last  equation  justifies,  in  retrospect,  the  specification  of  DRABP  since 
each  step  of  DRABP  is  chosen  so  as  to  minimize  the  overall  Lyapunov  drift  in  (29). 

Our  intention  is  to  bring  (29)  into  the  form  of  (17).  To  this  end,  we  need  to  bound  each  term  in  (29)  in  such  a  way  that  the 
policy  actions  (i.  e.  JP(f),  Rin(t))  are  removed  due  to  cancellations.  This  procedure  will  occupy  the  entire  current  section.  The 
following  remarks  will  be  useful.  For  a  given  X(t)  (which  implies  that  U(t),  D(t)  are  known)  and  S(t)  =  s ,  the  solution  to 
the  optimization  in  (13)  is  independent  from  E(t)  when  E(t)  >  P.  We  denote  this  solution  as  p*s  and  use  the  notation  P  ('.s.  e) 
for  the  solution  to  the  same  problem  when  E(t)  =  e  <  P.  Since  the  zero  power  vector  is  always  an  allowable  selection,  it 
follows  from  optimality  of  (13)  (recall  that  P{t)  is  selected  according  to  the  DRABP  policy) 

L  l 

Y [ m)ci (. s(t),p(t )) - D(t)p(t)}  > yuaa (s(t),o)  =  o  vi 

1=1  1=1 

L 

=>  Y  WA  (s’ P  (s’  e))  “  DA)pi  (s>  e)]  >  0  V  s,  e,  t. 

i=i 


(30) 


Consider  now  the  joint  probability  qs,e(t)  =  Pr  (S(t)  =  s,  E(t )  =  e|  X(t)),  when  DRABP  is  applied,  and  denote  with 
£{t)  the  set  of  values  that  E(t)  can  take  under  DRABP  (it  is  easy  to  show  inductively  that  £(t)  is  countable).  It  holds 


E 


YW)ci  (S(t),P(t))  -  D(t)Pi(t)\ 


1  =  1 


X(t) 


=  YYY  [Ui(t)ci  ( s ,  P(s ,  e))  -  D(t)Pi  (s,  e)]  qs,e(t) 

seSeeS(t)  i=i 


^YYY  [Ui(t)ci  (. s,p*s )  -  D(t)pla]  qsA*) 
seSees(t )  i=i 
e>P 

L  L 

=  Y  Y  Y  [uA)ci(s’P*s)  -D(t)pis\  QsAt)  ~  Y  Y  H  [t/i(i)c«(s>Pa) --D^Ks]  9s,e(0 

sgs  ees(t)  1=1  ses  eeS(t)  i=i 

e<P 

L  L 


(31) 


=  YY  [UA)cl  (s’P*s)  -  D(i)Pi,s ]  Ks  -  Y  Y  [UA)cl  AlPs))  -  D{t)Pi,a]  Pr  (-E(i)  <  P,  s(t) 

ses  i=i  ses  1=1 

L  L 

^YY  [Ui(t)ci(s,p*s)  -  D(t)p*hs ]  ns  ~Y^i  pr  (E(t)  <  P  |  J 


X 


ses  i=i  l=i 

where  the  first  inequality  is  due  to  (30),  the  fourth  line  is  due  to  the  independence  of  pg  from  e  (the  value  of  E(t))  and 
the  final  line  follows  from  the  bounds  on  U(t )  (due  to  Lemma  3)  and  C/  (s.p).  We  also  used  the  iid  property  to  write 
Y^ses  QsAt)  =  Pr  {S(t)  =  s|  X(t))  =  ns  in  the  transition  from  the  third  to  the  fourth  line. 

Pick  any  pdf  nip  acting  on  the  set  Vs-  We  have,  from  optimality  of  (13), 

L  L 

Y  [ Ui(t)ci  ( s,p*s )  -  Di(t)pls\  >  Y  [ Ui{t)ci  (, s,p )  -  D(t)pi]  V t,  s,  p,  (32) 

1=1  1  =  1 

so  that  multiplying  with  7Tp7Ts ,  summing  over  and  exploiting  the  pdf  property  7Tp  =  1  yields 

L  L 

Y  Y  Y  [UA)cl  AlPs)  -  D(t)Pi,s]  TTpTTs  >  Y  Y  Y  lUA)cl  (S’P)  -  D(t)pi]n^ns 

sespevs  1=1  sespevs  1=1 

L  L 

=>YY  WA'I  (s,P*s)  -  D{t)pls\  ns  >  Y  Y  Y  [Ui(t)ci  (s, p)  -  D(t)pi }  npns- 
sgs  1=1  s<ESpersi=i 


(33) 
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Now  pick  any  r  such  that  0  <  r  <  A,  whence  it  follows 


n 


At  (t)  —  <  At  (: t )  <  min 
A; 


+  Ai(t ),  Aij  V /,  t. 

Hence,  Ai(t)ri/\i  belongs  to  the  constraint  set  of  the  optimization  in  (11),  so  that 

L  L 

F  W(t)  -  u,{t))  Rl,in{t)  >  J2  WV)  -  UW)  Mt)  Y 


1=1 


1=1 


(34) 


(35) 


where  Ri^n(t)  is  selected  by  the  DRABP  policy.  The  last  expression  contains  the  random  variables  Ri,in(t),Ai(t)  so  that 
taking  conditional  expectations  upon  X  (t)  and  using  the  iid  nature  of  Ai  (t)  provides 


E 


F  -  Ui(t ))  Rl,in(t) 


1=1 


X{t) 


Tz'Eivm-WVrt  Vr:  0  <  r  <  A, 


(36) 


1=1 


since  K[Ai(t)\X (t)]  =  A /.  Also,  the  above  r  satisfies  0  <  77  <  Ai ,  so  that  optimality  of  (12)  implies 
L  L 

'52[Vfi(ji(t))-riYl(t)'yi(t)]>'^2[Vfi(ri)-riYi(t)ri]  Vr  :  0  <  r  <  A,  (37) 

1=1  i=i 

where  7 i(t),  which  is  selected  according  to  the  DRABP  policy,  is  a  random  variable.  Clearly,  a  similar  inequality  is  produced 
for  the  conditional  expectation  upon  X (t)  of  the  LHS  of  (37). 

Inserting  (37),  (36),  (33)  in  (29)  produces 


A{X(t))-2VE 


1=1 


X(t) 


<  C -2^(i//,(n)  -  vYi(t)n)  -2  J2(r)Yi(t)  -  u,(t))n 


1=1 


1=1 


(38) 


X 


-  2(1  -  S)BD{t). 


-2F  F  F  \Ui^)ci  (S>P)  -  D(t)pi\n^irs  +2^2ciUiPt  (^(f)  C  P 

seSpGPs  1=1  1=1 

Equation  (38)  holds  for  any  r  with  0  <  r  <  A  and  any  pdf  7 Tp.  Hence,  we  can  pick  r  to  be  the  maximizing  argument  of  (21) 
(i.  e.  the  vector  achieving  g%)  and  np  the  pdf  that  corresponds  to  it  according  to  (20).  This  creates  some  cancellations  in  (38) 
and,  through  the  definition  of  IZg,  strengthens  (38)  to 


A(X{t))-2VE 


F  fl  ('YiW) 


Ll=l 


X(t) 


L, 

<C~  2Vg*s  +  2F  C1U1  Pr  ^E{t) 


<  P 


1=1 


X 


Vi. 


A  direct  application  of  Lemma  2  now  provides 

L 


liminf  F/*  (71  (0)  >9*6- 

t.—>  OO  -4 - ' 


C1  Ef=i  ciUi„ 


1=1 


2V 


V 


limsup-  F?r  fp(j)  <  p)  , 

t—>  00  t  .  ^  7 


(39) 


(40) 


j=0 


E[Pr (E(t)  <  P\X(t))\  =  Pr (E(t)  <  P). 


4)  Performance  bound  of  DRABP:  Eq.  (40)  is  not  very  informative  since  our  original  intention  was  to  provide  a  bound  for 
the  liminf  of  1  fi  (n(i))  and  not  J2r=l  fi  (Tz(^))’  which  has  no  physical  meaning.  Additionally,  we  need  to  estimate  the 
limsup  appearing  in  (40).  The  first  issue  is  handled  through  the  following  Lemma,  which  essentially  exploits  the  stabilizing 
properties  of  DRABP.  The  result,  whose  proof  is  found  in  Appendix  V-C,  is  initially  stated  in  a  general  setting  and  then 
reduced  to  our  model 

Lemma  4:  Consider  any  compound  queue  Z(t)  (with  physical  or  virtual  components)  with  an  arrival  process  Ain(t)  and 
a  bounded  sendee  process  pout(t)  that  evolves  as  Z(t  +  1)  =  [Z(t)  —  /rOMt(f)]+  +  Ain(t),  with  E[Z’(0)]  <  00.  Denote  with 
p-outi t)  the  number  of  bits  actually  transmitted  at  slot  t  and  define  the  long-term  averages  dln (t),  pbout(t),  ptout(t).  Under 
any  stabilizing  policy,  it  holds 


liminf  g  ( ain(t ))  =  liminf  g  (p,out(f))  <  liminf  g  (p.out{t)) , 


(41) 


where  g  is  any  continuous  and  component-wise  increasing  function  (i.  e.  x  <  y  implies  g(x)  <  g(y)). 

Consider  now  the  queues  Y(t),  U(t).  These  queues  are  stable  (in  fact,  finite)  under  DRABP,  so  that  applying  the  above 
Lemma  w.  r.  t.  the  evolution  equations  (9),  (2)  and  setting  g(x)  =  J2iLi  fi(xi )  (this  g  clearly  satisfies  the  conditions  of  the 
Lemma)  results  in 

L  L  L  L 

liminf  F  fl  (vzW)  =  liminf  F  fi  (tfioW)  <  liminf  F  ft  (n,*n(i))  =  liminf  F  ft  (n(*)-) 


1=1 


1=1 


1=1 


1=1 


(42) 
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The  following  statement,  proved  in  Appendix  V-D,  provides  an  estimation  of  the  limsup  appearing  in  (40) 
Lemma  5:  Under  DRABP,  it  holds 


1  t-i  / o—l 

lim  sup  —  EPr  (E(j)<p)  <Pr  (J^B(k)< 


P  +  D  —  En 


3=0 


\k=0 


(43) 


where  cr  =  \Emax/P  -  1], 

Combining  Lemma  5  with  (42),  (40)  yields  the  main  result  for  the  downlink  case 
Theorem  1:  DRABP  stabilizes  the  system  and  achieves  a  performance  bound  of 

iimmf  Y  fl  (nW)  >9s~&  i~C2  Pr  ( Y  B(A)  <  )  (44) 

^°°  1=1  \k= 0  / 

where  C\  =  C / (2V)  and  Ci  —  'Yfd—i(ciUi)/V. 

An  examination  of  Lemma  3  reveals  that  [7j,  D  =  0(C),  C  =  0(1)  and  C\  =  O ( 1  / V ) .  Theorem  1  admits  the  following 
Corollary  1:  For  given  policy  parameters  V,  rj  >  0,  a  selection  of  Emax  such  that  Emax  >  P  +  D  implies  the  following 
performance  bound  for  DRABP 

L 

lim  inf  Y  fl  (fl  (*) )  >  9*s  ~  C t  •  (45) 

t — »00  -4 - ' 

1=1 


Since  C\  =  0(1 /V)  and  c/J  — +  g*  as  <5  — >  0,  the  above  RHS  can  approach  g*vg  arbitrarily  close  as  <5  — >  0  and  V  — >  oo 
(provided  that  it  holds  Emax  >  P  +  D),  which  implies  the  asymptotic  optimality  of  DRABP. 

Additionally,  a  slightly  more  complicated  asymptotic  result  can  be  established  for  the  regime  P  +  D  —  5oB  <  Emax  <  P+D 
(the  regime  Emax  >  P  +  D  is  handled  by  Corollary  1)  according  to  the  following 

Lemma  6:  For  given  policy  parameters  V,tp5  and  iid  recharge  process  Hit),  a  selection  of  Emax  s.t.  P  +  D  —  oSB  < 
Emax  <  P  +  D  implies  the  following  bound  for  DRABP 


^  0—1 


Pr  Y  ^ 


P  +  D  —  Ery 


<  exp 


\k=0 


cr  l 


where 


£(x)  =  sup  (Ox  +  9B  —  InE 
6  ' 


P  +  D  —  En 
aS 


—  B 


(46) 

(47) 


Due  to  the  iid  property,  any  value  for  t  can  be  used  in  the  last  expectation. 

Proof:  See  Appendix  V-E.  ■ 

The  qualitative  aspect  of  the  above  corollary  could,  in  retrospect,  be  derived  independently  through  the  following 
Lemma  7:  The  condition  Emax  >  P  +  D  implies  Eft)  +  Dft)  >  Emax  for  all  t  >  f  (where  t  =  inf{f  :  Eft)  =  Emax}  is 
the  first  time  the  battery  hits  Emax )  and  therefore  Eft)  >  P  for  all  t  >  f . 

Since  f  is  finite  w.  p.  1  (due  to  the  infinitely  often  overflow  mentioned  in  Appendix  V-D)  and  we  are  interested  in  the 
infinite-horizon  average  utility,  we  expect  that  what  happens  in  the  finite  part  up  to  t  will  not  contribute  to  the  average.  Since 
for  t  >  r  the  battery  never  falls  below  P,  one  of  the  loss  mechanisms  mentioned  in  Section  II-A1  is  eliminated  so  that 
performance  loss  (relative  to  an  average  policy  with  value  (1  —  S)B)  can  only  come  from  lost  recharge.  This  implies  a  DRABP 
bound  similar  to  (45),  where  the  C\  term  is  now  attributed  to  the  remaining  mechanism  of  lost  recharge.  Although  it  is  not 
used  in  the  subsequent  parts,  the  interested  reader  will  find  a  proof  of  Lemma  7  and  the  previous  intuition  in  Appendix  V-F. 

5 )  Removal  of  the  assumption  f(  (0)  <  oo:  The  only  reason  for  imposing  the  assumption  f[  (0)  <  oo  in  Section  II-A  is 
that  it  allowed  us  to  easily  provide  deterministic  bounds  for  all  queues  according  to  Lemma  3.  On  the  other  hand,  any  utility 
function  of  the  form  f(x)  =  xa  with  0  <  a  <  1  satisfies  /'( 0)  =  oo,  so  that  the  previous  analysis  must  be  modified.  This  is 
described  next. 

Specifically,  we  still  assume  monotonicity  and  concavity  for  all  /)(•)  as  well  as  /;( 0)  =  0  (since  any  other  condition  is 
unnatural)  and  f[(x)  <  oo  Va:  >  0.  However,  it  may  hold  fj (0)  =  oo  for  some  l.  For  a  given  (3  >  0,  we  propose  the  following 
policy  that  is  identical  to  DRABP  except  for  the  second  step  (selection  of  7  (t))  which  is  replaced  by 
•  For  l  such  that  fl  (0)  <  00,  select 


ll(t)=  arg  max  [V ft  (7; )  -  rjYi  ft )ji  ] 

s.t.  0  <  7  <  Ah 


(48) 


i.  e.  7 1  ft)  has  the  same  value  as  7 /(f)  in  DRABP. 
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For  l  such  that  fj  (0)  =  oo,  first  solve  the  problem 

ll  (t )  =  arg  max  [V  fi  (7/ )  -  rfYi  (t )jt } 

s.t.  (3  <  Ji  <  Ai, 


and  then  select  -q  (t)  as10 


7 f(i) 


0  ifyi(i)>nMj 

7"f  (f)  otherwise. 


(49) 


(50) 


We  denote  the  above  policy  as  /3-DRABP  to  emphasize  its  dependence  on  (3.  This  policy  is  chosen  so  that  it  approximates 
DRABP  as  (3  — >  0  and  (most  importantly)  guarantees  bounded  queues  for  Y (t),  U(t).  D(t).  In  fact,  a  repetition  of  the  proof 
of  Lemma  3  provides  the  following 

Corollary  2:  Application  of  (3-DRABP  results  in  the  following  bounds  for  all  t 


W)  <Yi  = 


^®-  +  Ai  if  f(  (0)  =  00, 

c'  /< 


V  ^(-°'>  +  Ai  otherwise 
Ui(t)  <  r,Yi  +  Ai  =  l %, 


D(t)  <  max  (UA) 


-  p  =  n 


(51) 


where  &i  =  maxse5  (Sj  0). 

Denote  with  7 i(t)  the  solution  to  (12),  so  that  yi(t)  =  7 f(t)  if  f[( 0)  <  00.  If  fj (0)  =  00,  it  holds  either  Yj(i)  <  Vfj((3)/r] 
or  Yi(t)  >  Vfj((3)/V.  In  the  former  case,  concavity  of  f)  implies  that  the  function  V  f)(x)  —  rjYi(t)x  is  non-decreasing  in  the 
interval  [0  (3\,  so  that  the  argmax  of  (12)  lies  in  the  interval  [(3  Ai ],  and  therefore  7 f(t)  =  7 i(t).  In  the  latter  case,  the 
argmax  of  (12)  lies  in  the  interval  [0  (3)  while  yf  ( t )  =  0.  Hence,  the  following  bound  holds  in  all  cases 


[Vfi{li{t))  ~  vYi(t)ji(t)\  - 
<  max  Vf(x)  =  V fi((3), 

0<x<(3 


Yfi  ~  vYi  {t)li  (t) 


<  ™*[Vfi(x)-vYi(t)x} 

0<x<{3 


(52) 


where  we  used  the  monotonicity  of  //.  We  define  T  =  {/  :  //( 0)  =  00}  so  that  the  separability  of  (12)  implies 


Li  Li 

Yi  Vh  (jl  w)  _  vYi{t)A  (t)  (7i)  -  vYi{t)  7i] 


1  =  1 


1  =  1 


v^m, 


(53) 


for  all  7  such  that  0  <  77  <  Ai.  Repeating  the  Lyapunov  drift  calculation  of  Section  II-B3  for  /3-DRABP  results  in  a  form 
identical  to  (29)  with  the  addition  of  the  term  2V  fA)  t0  the  RHS.  Hence,  the  following  extension  to  Theorem  1  is 
derived 

Theorem  2:  (3-DRABP  stabilizes  the  system,  with  the  queue  bounds  of  (51),  and  achieves  a  performance  bound  of 

hm inf  ^  f  (7  (f))  >  g*s  -  ^  fi ((3)  -  Cx  -  C2  Pr  {\^B(k)<  ^  +  ^  j  Emax  \  .  (54) 

1=1  IGF  \k=0  ) 

Since  fi  is  continuous  and  fi( 0)  =  0,  the  term  X^e:F  fi(P)  can  b e  made  arbitrarily  small  by  picking  a  small  enough  (3  (with 
a  corresponding  increase  in  queue  bounds)  so  that  asymptotic  optimality  is  retained. 

Hence,  the  assumption  fj (0)  <  00  can  be  removed  w.  1.  o.  g.  In  the  remaining  sections,  we  keep  this  assumption  to  slightly 
simplify  the  expressions  under  the  implicit  understanding  that  a  generalization  similar  to  the  above  is  possible. 


C.  Extensions  to  single-hop  and  multihop  networks 

A  careful  examination  of  the  proofs  of  the  various  statements  in  Section  II-B  reveals  that,  apart  from  algebraic  manipulations, 
the  optimality  of  DRABP  rests  on  the  following  crucial  point:  queues  Y  (t),  U  (t),  D(t)  are  deterministically  bounded  (in  fact, 
the  boundedness  of  each  queue  is  used  to  prove  the  boundedness  of  the  subsequent  ones)  and,  by  construction  of  D(t),  the 
battery  overflows  infinitely  often.  Hence,  we  expect  a  policy  that  has  the  previous  properties,  when  applied  to  a  general  network, 
to  achieve  similar  performance  to  DRABP.  This  is  studied  next. 

I0clearly,  fi  is  used  as  a  superscript,  rather  than  an  exponent,  in  this  context. 
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a)  Single-hop  networks:  Consider  a  single-hop  network  consisting  of  N  nodes,  where  each  node  n  is  allowed  to  transmit 
to  any  of  its  neighbors  belonging  to  the  set  On.  The  single-hop  constraint  implies  that  once  a  bit  is  transmitted,  it  also  exits 
the  network  layer.  Hence,  in  terms  of  the  bit  queues,  a  single-hop  network  is  just  an  aggregate  of  N  interacting  downlinks,  so 
that  most  of  the  notation  of  Sections  II- A,  II-B  can  be  modified  to  fit  the  new  model.  Specifically,  we  denote  with  En(t)  the 
battery  level  of  node  ?z  at  time  t,  and  with  Anj  (t  )  the  exogenous  arrival  process  at  node  n  destined  for  node  l  (where  /  £  On), 
while  Bn(t)  is  the  amount  of  replenished  energy  of  node  n  at  time  t.  These  processes  are  still  assumed  iid  with  mean  values 
of  A ni,  Bn  and  upper  bounded  by  Ani,Bn,  respectively.  Similarly,  Vni(t )  denotes  the  external  queue  of  node  n  where  bits 
destined  for  node  l  are  kept,  Rni,in(f)  the  number  of  bits  admitted  at  slot  t  into  node  n  destined  for  node  l  etc.  Denoting  with 
rnl(t)  the  average  rate  up  to  time  t  from  node  n  to  node  l ,  the  objective  is  to  maximize  lim  inf  (_>(*,  i  fni  )),  where 

fni  are,  per  link,  utility  functions  with  the  properties  mentioned  in  Section  II-A.  The  evolution  equations  are  essentially  the 
“vectorized”,  w.  r.  t.  the  nodes,  versions  of  the  downlink  evolutions,  i.  e. 


subject  to  constraints 


Vnl(t  +  1)  —  Vnl(t )  +  Anl(t)  —  Rnl,in(t )) 

Unl{t  +  1)  =  [Unl  (f)  -  p,nl  (S'(t),P(f))]  +  +  Rnltin{t), 

En(t  +  1)  =  min  (  En(t)  -  E  Pni{t)  +  Bn(t),  Emax  J  , 
V  l<EOn  ) 

Yni{t  +  1)  =  [' Ynl(t )  -  Rnl,in(t)]+  +  7 nl(t), 

Dn[t  +  1)  =  [ Dn(t )  —  (1  —  5)Bn(t)}+  +  Pni(t )> 

i&on 


(55) 

(56) 

(57) 

(58) 

(59) 


Rnl,in(t )  <  Vn l(t)  +  Ani(t)  Vtl,  l ,  t, 

P{t )  £  P£(t),  'Yl  Pnl(t)  <  min  Vn,  t. 

l&On 


Each  node  is  therefore  equipped  with  a  set  of  queues  connected  as  shown  in  Fig.  1.  We  denote  the  composite  queue 
X(t)  =  (U(t),Y(t),  D(t))  and  select  a  Lyapunov  function  (unless  otherwise  stated,  the  indices  n,l  will  range  over  the  sets 
{1, . . . ,  N},  On,  respectively) 

Ly(X{t))  =  YJKl{t)  +  vYZl{t))+Y,Dl(t)  <61) 

n,l  n 


The  analogue  of  Lemma  1  becomes 

Lemma  8:  No  stabilizing  policy  acting  on  a  single-hop  network  with  infinite  batteries  and  an  average  power  constraint  of 
B  can  achieve  performance  greater  than 


N 

9*avg  =max  f^nl) 

n=l  leOn 

s.t.  r  £1Z, 

0  <  r  <  A, 


(62) 


where 


R  =  Ir  :3iTp  s.t.  0  <  rni  <  Y  X]  Cnl  ( s >  P)  V  n,  l 

{  sespcTs 

Y  X!  E  Wp**  <  pn  Vn,  E  nP  =  1 
S£Sp<zvsieon  P&'Ps 

with  a  similar  definition  for  IZs  0-  e.  replace  Bn  with  (1  —  S)Bn  in  (63)).  It  still  holds  lim^o*?!  =  9*aVg ,  where  g$  is  the 
solution  to  the  optimization  of  (15),  albeit  with  the  new  definition  for  IZs- 

The  reader  can  repeat  the  steps  of  Section  II-B  and  arrive  at  a  result  similar  to  Theorem  1.  To  avoid  redundant  repetition 
and  assist  the  reader,  we  provide  only  the  salient  points  of  this  procedure.  Specifically,  the  Lyapunov  drift  under  any  policy 
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satisfies  the  following  analogue  of  (29) 


A(X(f))  -2PE 


E-^  (7 nl(t)) 


n,l 


X(t) 


<C  -  2E 


E  \Vfnl  (7nl(f))  -  r]Ynl{t)lnl{t)] 

n,l 


x{t) 


-2E 


E  ( S(t),P(t ))  -  -D„(f)P„z(t)] 


n,l 


X(t) 


-2E 


E  b?^ni(f)  -  Pni(f)]  Rnl,in(t) 
n,l 


x{t) 


(64) 


—  2(1  —  (5)  BnDn(t), 


where  C  =  NP2  +  Y^n  B2  +  Y^n,l  ((2rl  +  1  )^-nl  +  ^nl)  is  a  P°licy  independent  parameter,  with  cni  =  maxSjp  cni  ( s,p ). 
now  propose  the  following  policy,  which  is  again  a  “vectorized"  version  of  DRABP. 

Single-hop  rechargeable  adaptive  backpressure  policy  (SRABP) 

•  at  the  beginning  of  slot  t,  observe  U(t),Y (t),  V (t)  and  admit  Rin{t)  bits  into  the  network  layer  according  to 

1 1  {t )  —  ar g  max  (^7^  ni  (f )  P? %i  {t ) )  r^i 


We 


(65) 


s.t.  0  <  rni  <  min  \  Vni(t)  +  Ani(t),  Anij  \/n,l, 

select  7  (t)  as  the  solution  to  the  following  problem 

7 (t)  =  argmax  Y  W fnl{lnl)  ~  rjYnl{t)lnl\ 
n,l 

s.t.  0<7 ni<Anl  'in,  l, 

observe  channel  state  S(t)  and  select  transmission  power  P(t)  according  to 

P{t)  =  arg max  Y  Wm{t)cni  ( S(t),p )  -  Dn(t)pni] 


(66) 


(67) 


s.t.  p£VS(t),  Y  Pul  <  min  (En (: t ) ,Pj  in, 

ieOn 

Applying  the  above  policy  results  in  queues  Y(t),  U(t),  D(t)  being  deterministically  bounded  (essentially,  the  vectorized 
version  of  Lemma  3,  proved  in  an  identical  manner  as  in  Appendix  V-B,  provides  bounds  Yni,Uni,  Dn).  The  finiteness  of 
Dn(t)  implies,  with  an  argument  identical  to  the  one  in  Appendix  V-D,  that  each  battery  overflows  infinitely  often  and  hence 

At— 1 


Pr 


(En(t)  <  P,  fn  <  t'j  <  Pr  f  Y  Bn(k)  < 


(68) 


\k—0 


where  fn  =  inf{f  :  En{t )  =  Emax}  is  the  first  time  the  battery  of  node  n  hits  Emax. 

To  derive  the  analogue  of  (31)  under  SRABP,  we  define  qs,e(t)  =  Pr  {S(t)  =  s,  E(t)  =  e\ X(t))  and  denote  with  P  ( s ,  e) 

the  general  solution  to  (67).  For  the  special  case  of  E(t)  >  P  (i.  e.  when  all  batteries  are  above  P,  so  that  the  optimization 

of  (67)  is  independent  of  E(t))  we  denote  P  (s,  e)  =  pg,  so  that  it  holds 


E 


Y  \Unl{t)Cnl  ( S(t),P(t ))  -  Dn(t)Pnl(t)] 

n,l 


X(t) 


E  E  E  [Uni{t)cni  (s,  P{s,  e))  -  Dn(t)Pni  0,  e)]  qs,e(t) 

seSee£(i)  n,l 

EEE  [Urd(t)cnl  ( S,p*g )  -  Dn{t)P*nltS\  QS,e{t) 

seSee£(t)  n,l 
e>P 

=  Y  E  E  [uni{t)cni{s,p*g)  -  Dn(t)p*nltS\  qs,e{t)  ~  Y  Y  Y  [um{t)cni{s,p*s)  ~  Dn(t)Pni,s]  qs,e{t) 
seSee£(t)  n,l  seSee£(t)  n,l 

e^P 


N 


^EE  [Unl{t)cnl  (S,p*s)  -  Dn(t)p*nl  s\  Ttg  -  EE  Unl{t)c„i  {s,p*g)  Pr  (  (J  {En(t)  <  P,S(t)  =  s} 

SES  n,l  SG«S  n,l  \n=l 


X(t) 


N 


^EE  [' Unl{t)cnl  (S,p*s)  -  Dn{t)p*nl  s \  Kg  -  £  CnlUnl  Pi'  U  w*)  <  p } 


S£<S  n,l 


n,l 


y  n=  1 


X(t) 


(69) 
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where  we  used  the  fact  that  £n  i  [ Uni(f)cni  (s,  P  (s,  e))  —  Dn(t)Pni  (s,  e)]  >  0  Vs,  e  (since  p  =  0  is  always  an  allowable 
choice)  in  the  second  line  and  the  queue  bounds  on  the  last  one.  Using  the  maximization  properties  of  SRABP  in  (65)-(67) 
and  repeating  the  arguments  used  in  the  derivation  of  (32)-(37)  results  in 


A(X{t))-2VE 


^ Zfm  hnlit)) 


n,l 


x(t) 


<  c  -  2  ^2  [' Vfni(rni )  -  rjYni{t)rni]  -  2  ^  [ rjYni{t )  -  Uni(t)\  rni 


n,l 


n,l 


N 


-2  ^2^2[Unl{t)cnl  (. S,p )  -  Dn(t)pnl]  TTpTTS  +  2^C„;C/„;Pr  (  jj  {En{t)  <  P} 

S.p  n,l  n,l  \n=l 


X(f)  -2(1  -5)YJBnDn{t). 


(70) 


The  last  equation  holds  for  any  pdf  irp  and  r  s.t.  0  <  r  <  A.  Hence,  we  can  pick  r  =  r*  to  be  the  vector  achieving  g$ 
and  7 Tp  the  corresponding  pdf.  It  then  follows  from  the  definition  of  7 Zg 


^  nl  —  E  E  Cnl(s,p)TTpnS  'in,  l, 
s&spers 

E  E  E  PnlKpTTs  <  (1  -  8)Bn  in, 

sespeVg  iGOn 


(71) 


so  that  the  analogue  of  (39)  is 


A  (X(t))  —  2UE 


Y,fnl  (7 nl(t)) 

n,l 


X(t) 


(  N 

E  C  2V g$  -+■  2  ^  ^  CfiiUfii  Pr  U  {EJt)  <  P} 

n,l  \n=  1 


We  now  invoke  Lemmas  2,  4  to  get  the  following 

Theorem  3:  SRABP  stabilizes  any  single-hop  network  and  achieves  a  performance  bound  of 


liminf  V  fnl  ( r„i(t ))  >  g*s 

t — >OC  Z ' 

n,l 


c 

2V 


t-i 


N 


m  ”(limsupI  ^Pr  I  (J  {En(j) 

t — >00  \ 

j—0  \n— 1 


v 


(72) 


(73) 


Using  the  union  bound,  standard  properties  of  limsup  and  the  fact  that  each  queue  individually  overflows  infinitely  often 
(so  that  Lemma  5  is  applicable)  results  in  the  following 
Corollary  3:  SRABP  satisfies  the  following  bound 


liminf  \  fni  ( rni(t ))  >  g*s  - 

t — too  Z ' 

n,/ 


c 

2V 


'Yhn.l  Cnl  Uni 

■  E  Pr 


V 


n—1 


(k)< 


P  +  Dn  —  E„ 


(74) 


Hence,  selecting  Emax  >  P  +  max„  Dn  makes  the  probability  appearing  in  the  RHS  of  the  last  equation  zero,  so  that  SRABP 
is  asymptotically  optimal. 

1 )  Multihop  networks:  The  main  difference  between  single-hop  and  multihop  networks  is  that  in  the  latter  case  intra-node 
traffic,  in  addition  to  exogenous  arrivals,  is  explicitly  allowed.  This  suggests  that  a  vectorization  of  DRABP  to  a  multihop 
setting  will  be  more  involved.  In  fact,  new  notation  is  required  to  capture  the  fact  that  a  packet  may  need  many  hops  to 

reach  its  destination.  Specifically,  we  model  a  multihop  network  as  a  standard  digraph  (A f,C)  (with  J\f  =  {1, . . . .  A' } )  and 

use  the  commodity  concept  of  [15]  to  assume  that  each  bit  belongs  to  a  packet  with  an  associated  commodity  c  €  1C  (which 
minimally  defines  the  packet  destination11  but  may  contain  additional  information).  Hence,  we  denote  with  An'1  (t)  the  number 
of  commodity  c  bits  exogenously  generated  at  node  n  and  slot  t  (we  assume  (t)  to  be  iid  with  expectation  and 
an  upper  bound  of  A If*)  with  similar  interpretations  for  the  internal/external  queues  Un\t),  VnC\t),  respectively.  We  also 
denote  with  R^\n{t)  the  number  of  externally  admitted  commodity  c  bits  at  node  n  and  slot  t,  while  jPP (f)  is  the  number 

of  commodity  c  bits  transmitted  on  link  (a,  b)  during  slot  t.  The  total  number  of  bits  (for  all  commodities)  transmitted 

over  a  link  is  upper  bounded  by  a  vector  function  c(S(t),  P(t))  with  the  properties  mentioned  in  Section  II-A,  so  that 
it  holds  (S(t),P(t))  <  cab  (S(t),  P(t)).  We  also  define  cab  =  ma x-s,pcab  (s,p).  The  objective  is  to  maximize12 

liminft^oo  E„,c/nc  (r£  •„)>  where  f^n(f)  =  \  £r=o  E[i?^„(r)]  and  fnc{-)  are  typical  utility  functions. 


ne.  g.,  for  sensor  networks,  different  commodities  may  correspond  to  different  data  collection  points  (sinks). 
12 in  the  following,  the  indices  n,  c  will  range  over  the  sets  J\f,  /C,  respectively,  unless  otherwise  stated. 
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Motivated  by  the  downlink  and  single-hop  analysis,  we  introduce  virtual  queues  Dn{t)  and  Yn°\t)  to  handle  the  average 
power  and  (finite)  arrival  constraints,  respectively.  Hence,  the  queues’  evolution  is  as  follows 

V^c\t  +  1)  =  V^\t)  +  A^(t)  -  (75) 

1  + 

U£\t+ 1)<  +R(n%(t)  +  J2^(S(t),P(t)),  (76) 


En(t  +  1)  =  min  ^En(t)  -  ^  Pnb{t)  +  Bn(t)  5  Emax^  ) 

EE  +  1)  =  [EE)  -  R(n%(t)] +  + 

Dn(t  +  1)  =  \Dn(f)  —  (1  —  5)Bn(t)]+  +  Pnb(t), 


subject  to  constraints 

<L(0<EE)  +  EE)  V  n,  c,  t, 

E>  0,  fJ${S(t),P(t))=0  V (a,  b)  £  CSc\t,  m 

P(t)  £  Vs(t)’  y Xpnb(t)  <  min^E„(f),pj  Vn,t, 

b 

where  indices  a,  b  range  over  the  set  of  incoming  and  outgoing  neighbors  of  node  n,  respectively.  The  second  constraint  in 
(80)  models  the  fact  that  all  commodity  c  bits  may  be  required  to  be  transmitted  through  links  belonging  to  a  specific  set 
£(c)  c  C  only  (setting  £*V)  =  £  for  ajj  c  effectively  removes  this  constraint  so  that  all  commodities  can  be  routed  to  all  links). 
Finally,  the  reason  for  (76)  being  an  inequality  rather  than  an  equality  is  that  the  actual  amount  of  incoming  traffic  to  node  n 
may  be  less  than  /E  ( S(t ),  P(t))  due  to  low  queue  occupancy  of  the  neighbors. 

In  order  to  derive  an  analogue  of  Lemma  1,  we  first  introduce  the  following 

Definition  1:  For  given  node  set  AT,  commodity  set  1C  and  link  constraint  sets  C^c\  a  (consistent)  multi-commodity  flow 
{/lt>  }  a  vector  that  satisfies  the  following  conditions  for  all  a,b  £  J\f  and  c  £  1C 

f (c)  >  n 

Jab  —  U> 

(a,b)  #  =>  f$  =  0,  (81) 

f(c)  _  Ac)  _  Q 
J  aa  J  dest(c),b  ’ 

where  dest(c)  is  the  destination  node  for  commodity  c. 

In  the  subsequent  analysis,  the  term  “flow”  will  exclusively  refer  to  a  consistent  flow.  Reference  [15]  provides  the  following 
Lemma  9:  No  stabilizing  policy  acting  on  a  multihop  network  with  infinite  capacity  batteries  and  an  average  power  constraint 
of  B  can  achieve  performance  greater  than 


9avg  ~  mUX  J2fnc(rn)) 
n,c 

s.t.  r  £  1Z, 

0  <  E  <  A(nc)  V  n,  c, 


n  =  {r  = 


[r=  (E)  :  3 7Tp ,  flow  {/E  s.t.  E  <  ^  ^  Vn,c,  ^  zip  =  1  Vs, 

l  bap 

Cab(s,p)npTTS  V a,  b,  EEE  Pnb^p^s  <  Bn  Vn  j 


c  S  p  S  p  b  J 

In  accordance  with  previous  sections,  we  define  the  set  IZs  in  a  manner  similar  to  1Z  except  that  the  average  power  constraint 
B  is  replaced  by  (1  —  S)B.  We  also  define  the  multihop  analogue  of  (21)  as 


9s  =  max  ^  fnc  (E) 


s.t.  r  £  TZs, 


0  <  r  <  A, 


(84) 
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where  A  =  ( A^J  .  The  continuity  properties  of  (22)  are  still  applicable  as  well.  We  denote  the  composite  queue  X(t)  = 
(U(t),Y(t),  D{t))  and  select  a  Lyapunov  function 

Ly(X(f))  =  W([/M(i))2  +  ?7(yW(f))2l  +Y,D2n(t)-  (85) 

n,c  L  -1  n 

Squaring  the  evolution  equations  (76),  (78),  (79),  performing  some  algebra  and  rearranging  terms  yields 


A(X(f))  -2LE 


e 


■(c) 


X(f) 


<<7-2E 


-  2E 


-  2E 


E 

a,  b 


Y  Ulc\t )  -  Ulc\t )  (S(f),P(f))  -  Da(t)Pab(t) 


E  (7 nHt))  ~ VYnC)(thn\t ) 

ll 

X(f) 


X(f) 


(86) 


E  K(c)w  -  ^c)w] 


X(f) 


-2(1-5)  ^  BnD„(t). 


The  rationale  of  minimizing  overall  Lyapunov  drift  suggests  the  following  policy 
Network  rechargeable  adaptive  backpressure  policy  ( NRABP ) 


to 


•  at  the  beginning  of  slot  t,  observe  queues  17(f),  Y(t),  V(f)  and  select  Rin{t)  =  (EE(f)  j  bits  for  admission  according 

Rln (f )  =  arg  max  ^3  (VE  (*)  -  Un]  (*))  ric) 

n,c 

s.t.  0  <  <  min  (Vn(c)  ( t )  +  (f) ,  A^)  V  n,  c, 


(87) 


select  7(f)  according  to 


7(f)  =  arg  max  ^3  (E0)  -  Win  ) 

n,c 

s.t.  0  <  7^c)  <  A^  V  n,  c, 


(88) 


•  observe  S'(f)  and  select  P(f),  i-i^  (f)  so  as  to 


E 


maximize 


ab 


(89) 


YWab\t)^ab  -  Da{t)Pab 
c 

sX-  PerS(ty  Y  Pnb  -  min  (E (*) >  -P )  v  n, 

b 

Y^ab  <c°.b(S{t),P(t))  V a,  b, 

C 

v$=0  V(a,6)^£(c), 

where  (f )  =  Ua  '1  (f )  —  uA'1  (f )  will  be  referred  to  as  the  differential  backlog  between  a ,  b. 

The  following  observations  can  be  made.  The  problems  in  (87),  (88)  are  separable  and  can  be  solved  distributively.  It  is 
also  easy  to  show,  using  a  greedy  exchange  argument,  that  the  solution  to  (89)  is  equivalent  to 


where 


Pit)  =  arg  max  53  [wab(t)cab  (S(f),p)  -  Da(t)pab 

ab 

•t.  pePS(t)1  Y,Pnb  <  min  ( En(t ),  p)  V 

b 

Wab(t)=  max  (7ic)(f)-7b(c)(f)}, 

c:(a,6)££(c)  ^  J 


(90) 


(91) 


so  that,  for  each  time  slot,  we  need  only  select  a  single  commodity  <bW  =  argmaxc:(a  6)e£(c){17ic)(f)  -  Pb(c)(f)}  per  link 
(obviously,  different  links  may  carry  different  commodities).  A  casual  glance  at  (90)  also  reveals  that  the  optimal  solution  has 
the  property  that  Pab(f)  =  0  if  Wab{t)  <  0.  Hence,  with  no  loss  of  optimality,  we  define 

i  + 


W*ab{t)  = 


max 

c:(a,b)e£(c) 


Ht)-ut 


(c) 


(92) 
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and  use  this  value  instead  of  Wab(t)  in  the  subsequent  expressions.  Hence,  the  following  result  is  true 
Lemma  10:  Application  of  NRABP  results  in  the  following  conditions  being  satisfied 

•  for  each  c  G  1C,  there  exists  a  sufficiently  large  F ^  such  that  R^f\n{t)  =  0  whenever  Un\t )  >  F^c\ 

•  for  any  link  ( a,b )  such  that  W^b\t)  <  0  it  holds  l-f  b  (t)  =  0.  As  a  result,  the  implication  W*b(t)  =  0  =>■  Pab(f)  —  0  is 
true. 

Proof:  The  first  condition  follows  from  the  bang-bang  property  of  the  solution  to  (87)  and  the  fact  that  all  queues  Yn°\t) 
are  stable  under  NRABP  (the  latter  is  proved  by  a  vectorization  of  the  proof  in  Appendix  V-B),  while  the  second  one  has 
already  been  proved.  ■ 

Under  the  previous  observations,  (86)  becomes 

A(X(f))-2VE  c(jic)(t))  X(t)  <C-  2E  E  [vfnc  (7<c)  (t ))  -  VY^  (f  )7^  (t)]  X(t) 

n,c  J  L  nJc 

-2E  ^  [w:,b{f)cab  (5(f),  P(t))  -  Da(t)pab]  X(t)  -2E  ^  [^(f)  -  f/^(f)]  X(t)  (93> 

a,b  L  n,c 

-2{l-8)YJBnDn{t). 

n 

The  main  stability  result,  proved  in  Appendix  V-G,  is  now  stated 

Lemma  11:  NRABP  stabilizes  all  queues  according  to  the  following  bounds 

uic\f)  <  u<f\  (94) 

Dn{t)  <  UnCn  +  Pn  =  Dn, 

where  Un  =  maxc  Un'1  and  Cn  =  max^  'ffb  (s)  0). 

As  before,  we  denote  with  qs.eft)  =  Pr(5(f)  =  s,  22(f)  =  e\  X(t))  the  respective  probability  under  NRABP  and  with 
P  (s,  e)  the  solution  to  (90)  when  5(f)  =  s  and  Eft)  =  e  (when  e  >  P,  the  solution  is  independent  of  e  and  is  denoted  as 
p*s).  The  analogue  of  (31)  is 

E  E  [^a6(f)ca6  (5(f), P(f))  -  Da(t)Pab(t)\  X(t)  =E  E  Y,iWab(tyab(s,P(s,e))-Da(t)Pab(s,e))qs,e(t) 

a,b  S£S  €Z(z£(t)  a,b 

EEE  [W*b(t)cab  ( s,p*s )  -  Da{t)p*ab,s\  Qs,e{t) 

seSeef(t)  a,b 


-EE  E  [Wab(.t)Cab(s,P*s)  -Pa(fKM]  QS,e(t)  ~  E  E  ^[Wab{t)Cab{s,p%)-Da{t)p*abs\qS.e{t) 
sesee£(t)  a,b  sesee£(t)  a,b 

e^P 

>EE  [W«6(*)c«6  (s,P*s)  -  Da{f)p\s\  Ks  -  (e^I  Pr  f  U  <  P }  W )  - 

SG«S  a,b  \  a,b  J  \n=l  / 


where  the  usual  manipulations  have  been  performed.13 

We  now  follow  a  procedure  similar  to  the  one  that  produced  (33),  (36),  (37).  Specifically,  the  maximizing  properties  of 
NRABP  yield  for  all  r  s.t.  0  <  <  A 


E  [Vfnc  (7 -  VY^(t)ri\t)]  X(t)  >  E  [Vfnc  (»£>)  -  ^  (t)r^ 

n,c  J  n,c 

E  (vYjfHt)  -  Uic\t))  R$n(t)  X(t)  >  E  ( vY^c\t )  -  uic\f))  ric>. 


13 specifically,  the  transition  from  the  third  to  the  fourth  line  of  (95)  relied  on  the  fact  that  W*.  (t)  =  —  UiC“b^\t)  <  Ua ■ 
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The  solution  to  (89)  when  En(t)  >  P  for  all  n  also  implies 

E  iWab(t)Cab  (s,p*s)  -  Da(t)p*b  g\  >  Y  [W*b(t)cab  ( S,p )  -  Da(t)pab\ 


a,b 


a,b 


>E 


E  wab \pTab  (S’P)  -  Da{t)Pab 


(98) 


Vs  €  S,  M p  e  Vs, 


where  the  second  inequality  follows  from  the  facts  W*b(t)  >  W^(t)  for  all  c  and  (s,p)  <  cab(s,p).  Multiplying 

the  last  inequality  by  TipTTs  (where  ~p  is  an  arbitrary  pdf)  and  summing  over  ,s.  p  produces 


E  E  [ wab(t)cab  («,ps)  -  Da[t)P*abs]  i ts>y  EE 

ses  a,b  SGSpeVg  a,b 

We  now  insert  (99),  (97),  (96),  (95)  into  (93)  and  perform  some  algebra  to  get 


E  Wab \t)Pab  (S’P)  -  Da{t)pab 


KpTTS-  (99) 


A  (X(t))  -  2VE 


-2E  E  E 


E  f™  (vic) 


X(t) 


<C-2VY/  Vfn  c  (r[c>)  +  2  Y,  U<c)(t)rV>  -  2(1  -  6)  Y  BnDn{t) 


SeSpeVs  a,b  L  C 


^bWVab  (S’P)  -  Da(t)pab 


N 


KpKs  +  2  X  UaCab  1  P^  U  w*)  <  p } 


a,b 


\  n— 1 


(100) 


X(f)  . 


Expanding  (t)  into  Ua°'> (t)  —  Ubc'>  (t)  and  performing  a  change  of  indices  so  that  all  backlogs  appear  with  index  n  in  the 
above  relation  yields  for  all  r  with  0  <  r  <  A 


LHS  of 


(100)  <  C  -  2V  Y  fnc  (Ec))  +  2  Y  (: t ) 


r(«0 


-EE  Pnb  (s,p)nf,TTS  +  EE  Pab  (s,P)7Tp7rS 


S,p  b 


2  YD”V) 


E  E  PnbKpTTS  -  (1  ~S)Bn 

S,p  b 


N 


+  2  £  Ua^ab  1  U  w*)  < 


X(f)  . 


v  n=l 


(101) 


Selecting  r  as  the  solution  to  (84)  and  Tip  as  the  corresponding  pdf  in  the  definition  of  7 Zg  creates  many  cancellations  in  (101) 
and  finally  produces 


JV 


LHS  of  (100)  <C-  2  Vg*5  +  2  PT  Uacab  Pr  |J  {En(t)  <  P} 


\  n= 1 


X(t)  . 


(102) 


As  in  previous  sections,  we  invoke  Lemmas  2,  5  (since  each  individual  queue  En(t)  overflows  infinitely  often)  to  get  the 
final 

Theorem  4:  NRABP  stabilizes  any  multihop  network  and  achieves  a  performance  bound  of 


lim  inf 

t—>  OO 


E  (r"n  }  (*))  >  9*s  -  E  -  1  (  E  UaCab  ]  E  Pr  (  E  B ^  ^  ~  ~  ^  ^  I  ’  (103) 


(  <7—1 


P  +  Dn  —  En 


a,b 


n— 1  \k—0 


where  a  =  \Emax/P  -  1], 


III.  The  Hidden  cost  of  hidden  terminals 

A.  Where  is  Hidden  Terminal? 

The  hidden  terminal  problem  is  closely  related  with  the  Transmission  Range  (TX_Range),  Carrier  Sensing  Range  (CS_Range) 
and  Interference  Range  (IF_Range)  of  stations  in  WLAN.  The  followings  are  the  widely  adopted  definitions  for  these  three 
ranges  [25]  [26],  The  TX_Range  is  the  range  (with  respect  to  the  transmitting  station)  within  which  a  transmitted  frame  can  be 
successfully  received.  The  CS_Range  is  the  range  (with  respect  to  the  transmitting  station)  within  which  the  other  stations  detect 
a  transmission.  The  IF_Range  is  the  range  within  which  stations  in  receive  mode  will  be  “interfered  with”  by  a  transmitter, 
and  thus  suffer  a  loss. 

The  measurement  study  by  [25]  shows  the  following  relationship  for  802.1  lb  networks:  TX_Range  <  IF_Range  <  CS_Range. 
In  addition,  it  shows  that  CS_Range  is  about  1.5  times  of  the  TX_Range.  We  have  also  measured  the  three  ranges  in  802.1  lg 
networks  and  got  similar  results  [27],  From  our  measurements,  CS_Range  is  between  1.2xTX_Range  and  1.6xTX_Range, 
depending  on  the  environment.  Although  the  exact  values  of  these  two  measurement  results  are  different,  their  ratios  between 
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Fig.  2.  Illustration  of  the  hidden  terminal  problem  in  ad  hoc  networks.  For  transmitter  T,  stations  in  in  the  shaded  area  are  hidden  terminals. 


CS_Range  and  TX_Range  are  within  the  same  range  (1.2,  1.6).  The  difference  is  mainly  due  to  different  wireless  cards  used 
for  the  measurements  and  different  measuring  environments. 

As  illustrated  by  Fig.  2,  nodes  within  the  interference  range  of  a  receiver  and  out  of  the  carrier  sensing  range  of  the  transmitter 
are  usually  called  hidden  terminals  [26].  In  a  single-cell  WLAN,  all  nodes  are  within  the  TX_Range  of  the  AR  so  hidden 
terminals  are  those  within  the  TX_Range  of  the  AP  and  out  of  the  CS_Range  of  the  transmitter,  as  illustrated  by  Fig.  3a.  As 
we  can  see  from  Fig.  3a,  to  eliminate  hidden  nodes,  CS_Range  must  be  no  less  than  2xTX_Range  to  make  sure  that  a  node 
at  the  edge  of  a  WLAN  can  sense  all  of  the  other  nodes.  However,  both  the  measurement  results  from  [25]  and  from  us  [27] 
show  that  CS_Range  is  less  than  2xTX_Range  for  many  existing  802.11  wireless  cards.  This  is  why  hidden  nodes  exist  in 
current  802.11  networks. 

The  performance  degradation  due  to  hidden  terminals  is  best  shown  in  the  following  example.  In  Figure  3a,  a  source  station 
is  trying  to  transmit  to  the  access  point  (AP).  It  starts  the  transmission  with  a  RTS/CTS  handshake:  The  source  station  sends  a 
RTS  and  the  AP  replies  with  a  CTS.  If  the  handshake  is  successful,  all  the  nodes  in  the  WLAN  will  be  aware  of  the  subsequent 
transmission  and  keep  silent  during  the  transmission  period.  Note  that  RTS  signal  cannot  be  sensed  by  the  hidden  nodes,  who 
are  beyond  the  CS_Range  of  the  source  station.  So  during  the  transmission  of  RTS,  the  hidden  nodes  may  initiate  their  own 
transmissions  if  their  backoff  counters  reach  zero.  In  the  802.11  standard,  the  duration  of  RTS  transmission  is  much  longer 
than  that  of  a  backoff  slot.  For  instance.  With  a  typical  setting  of  802.1  lg  [28],  a  RTS  takes  about  59  /. is  while  a  backoff  slot 
duration  is  9  gs.  This  implies  that  each  hidden  terminal  has  at  least  six  chances  to  initiate  their  own  transmissions  during  a 
RTS  transmission,  and  collide  with  this  RTS  at  the  access  point.  The  collisions  waste  channel  time  and  thus  degrade  network 
performance.  In  this  example,  we  assumed  that  RTS/CTS  mechanism  is  active.  If  not,  the  hidden  terminals  will  have  more 
chances  to  initiate  their  own  transmissions  during  the  ongoing  data  packet  transmission,  which  is  much  longer  than  the  RTS 
transmission.  Then  the  network  performance  will  be  degraded  more  severely. 

Moreover,  this  performance  degradation  is  not  uniform  for  all  nodes  in  a  WLAN.  Instead,  it  depends  on  the  locations  of  the 
nodes.  Assuming  stations  are  randomly  located  in  the  network.  Figure  3  shows  that  a  station  sees  more  hidden  nodes  when  it 
is  far  away  from  the  AP  (Figure  3a),  compared  with  the  case  when  it  is  close  to  the  AP  (Figure  3b). 

Various  types  of  areas  used  in  this  paper  are  derived  in  the  Appendix,  which  analytically  shows  the  relationship  between 
the  total  hidden  area  and  the  distance  from  the  transmitter  to  the  AP  (Eq.  183).  The  relationship  is  plotted  in  Fig.  4,  where 
the  area  is  normalized  by  the  whole  network  area  and  the  distance  is  normalized  with  respect  to  TX_Range.  As  we  can  see, 
the  hidden  area  grows  almost  linearly  as  the  distance  increases. 

B.  An  Analytical  Model  for  the  Hidden  Terminal  Problem 

1)  System  Model:  We  consider  an  802.11  single-cell  WLAN  with  only  uplink  traffic,  wherein  all  transmissions  are  initiated 
by  stations  and  destined  to  access  point  (AP).  The  RTS/CTS  mechanism  is  assumed  to  be  enabled.  The  AP  is  located  at  the 
center  of  the  network  and  the  other  stations  are  randomly  located  within  the  AP’s  transmission  range.  Besides,  we  set 

CS_Range  =  g  x  TX_Range,  g  £  (1,2).  (104) 

Note  that  this  range  of  g  well  reflects  the  observations  in  independent  WiFi  measurements  [25] [27],  Other  assumptions  are 
listed  below. 

1)  No  capture  effect,  i.e.,  when  two  or  more  packets  collide  with  each  other,  all  of  them  will  fail. 

2)  No  channel  errors. 

3)  All  transmissions  are  at  the  same  data  rate. 

4)  All  stations  are  always  in  backlog  state. 

As  in  [29],  the  conception  of  “time  slot”  is  extended  to  refer  to  any  continuous  time  period  that  a  station  observes.  It  is  no 
longer  only  the  backoff  time  slot  in  802.11;  instead,  the  duration  of  a  time  slot  in  our  model  can  be  one  the  followings: 

•  a:  the  duration  of  a  successful  transmission. 
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(a) 


Fig.  3.  Illustration  of  the  hidden  terminal  problem  in  infrastructure  mode  802.11  networks.  Note  that  the  hidden  area  in  (a)  is  larger  than  that  in  (b)  (where 
the  source  station  is  closer  to  the  AP). 


Fig.  4.  t  increases  almost  linearly  with  the  distance  to  the  AP. 


•  j3:  the  duration  of  a  collision  period. 

•  6:  the  duration  of  a  backoff  time  slot. 


2)  Network  Analysis:  In  [29],  Bianchi  models  the  802.11  DCF  with  a  two-dimensional  Markov  chain.  For  a  given  collision 
probability  Pc,  the  model  predicts  the  transmission  probability  r: 


2(1  -  2 Pc) 

(1  -  2 PC){W  +  1)  +  PCW[  1  -  (2 Pc)m] 


005) 


where  W  is  the  initial  backoff  window  size  and  m  is  the  maximum  number  of  backoff  stages.  On  the  other  hand,  the  network 
conditions  for  a  transmitted  packet  to  collide  is  that,  in  a  time  slot,  at  least  one  of  the  remaining  stations  transmits: 


Pc=  1  -  (1  -  T )N~X 


(106) 


where  N  is  the  number  of  nodes  in  the  network.  By  solving  these  two  equations  numerically,  Bianchi  obtains  r  and  Pc,  based 
on  both  of  which  he  then  computes  the  network  throughput. 

We  will  follow  the  same  approach  in  our  modeling.  The  main  difference  between  Bianchi’s  analysis  [29]  and  ours  is  that 
we  consider  hidden  terminals.  To  capture  the  location-dependent  nature  of  hidden  terminal’s  impacts  (as  described  in  Section 
III-A),  we  slot  the  whole  network  area  into  M  evenly  spaced  concentric  annuluses  centered  at  the  AP,  as  shown  in  Fig.  5. 
Let  r  be  the  TX_Range  of  a  station.  Each  annulus  A(i)  (i  ^  2)  is  centered  at  AP  and  confined  by  two  circles  with  radius 


23 


Fig.  5.  M  evenly  spaced  concentric  annuluses  centered  at  the  AP  (M  =  4  in  this  figure). 


Fig.  6.  Illustration  of  Hidden  Area  H(i,j)  and  Covered  Area  E(i,j). 


r{i  —  1  )/M  and  ri/M  respectively;  while  A{  1)  is  the  interior  of  the  innermost  circle  centered  at  AP  and  with  radius  r/M. 
As  we  can  see  from  Fig.  5,  stations  in  annulus  A(i)  are  at  approximately  the  same  distance  <l(i)  from  the  AP,  so  they  observe 
approximately  the  same  number  of  hidden  nodes.  Therefore,  they  experience  approximately  the  same  collision  probability 
Pc(i),  which  results  in  approximately  the  same  transmission  probability  t(i)  (from  Eq.  106).  Of  course,  the  larger  M  is,  the 
better  the  approximations  are.  In  fact,  as  we  will  show  later,  the  approximations  here  are  very  accurate  even  when  M  is  as 
small  as  4. 

As  in  [29],  we  adopt  the  key  assumption  that  each  station  independently  makes  a  transmission  attempt  at  any  time  slot  with 
a  constant  probability  r.  However,  there  is  a  difference  on  the  meaning  of  r.  In  [29],  r  is  equal  for  all  nodes,  since  the  model 
assumes  no  hidden  nodes.  But  in  our  model,  r  depends  on  the  distance  between  a  node  and  the  AP. 

Following  the  same  way  that  Bianchi  derives  Eq.  105  in  [29],  we  obtain  the  relationship  between  t(i)  and  Pc{i)  for  a  node 
in  annulus  A{i)  (A  brief  derivation  is  presented  in  the  Appendix): 


_ 2(1  -  2Pc{i)) _ 

(1  -  2 Pc(i)){W  +  1)  +  Pc(i)W[  1  -  (2 Pc{i))m] 

(*  =  1,2, ...,M) 


(107) 


We  now  derive  the  network  conditions  for  a  transmitted  packet  to  collide  in  the  presence  of  hidden  terminals.  Let  us  first 
define  the  hidden  area  H{i,j)  and  the  covered  area  E(i,j ),  with  i,j  £  [1,M],  For  a  source  station  ( Src )  located  in  annulus 
A(i),  H(i,j)  is  defined  as  the  part  of  annulus  A(j)  that  is  out  of  the  CS_Range  of  Src;  while  E(i,j )  is  defined  as  the  part 
of  annulus  A(j)  that  is  within  the  CS_Range  of  Src.  Both  of  them  are  normalized  with  respect  to  the  total  area  within  the 
AP’s  TX_Range.  Fig.  6  shows  the  H(i,j)  and  E(i,  j)  for  a  source  station  located  in  A(i).  Note  that  fT(4, 1)  is  zero  in  this 
example  and  not  shown  in  the  figure.  The  computations  of  the  areas  H(i,j)  and  E(i,  j)  can  be  found  in  Appendix. 

We  can  then  determine  the  expected  number  of  nodes  in  hidden  area  H(i,j)  and  covered  area  E(i,j).  Let  the  total  number 
of  stations  in  the  network  be  N,  then  a  station  in  annulus  A(i)  sees  an  average  NH(i,j)  hidden  nodes  and  NE(i,j)  covered 
nodes  in  annulus  A  (j ) . 

Let  us  consider  the  conditions  for  a  successful  transmission.  The  transmission  procedure  is  a  RTS -CTS -DATA- ACK  four-way 
handshake.  Since  Src  senses  the  channel  before  the  transmission  of  RTS,  we  are  sure  that  all  nodes  within  Src’ s  CS_Range 
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Fig.  7.  Aggregate  Network  Throughput:  Model  VS.  Simulation. 


are  idle  in  the  time  slot  just  before  the  RTS  transmission.  However,  Src  has  no  knowledge  about  the  status  of  the  hidden 
nodes.  Denote  p  as  the  number  of  backoff  time  slots  that  a  RTS  transmission  spans,  that  is, 

tRTS  =  P$  (108) 

where  tuTS  is  RTS  transmission  time. 

An  obvious  condition  is  that  no  nodes  within  Src’s  CS_Range  initiates  transmissions  in  the  same  time  slot  as  Src  does. 
Since  Src’ s  RTS  signal  cannot  be  sensed  by  its  hidden  nodes,  another  condition  is  that  no  hidden  nodes  transmit  during  the 
RTS  transmission.  Likewise,  when  Src  initiates  a  transmission,  a  hidden  node  may  already  be  transmitting  a  RTS,  since  Src 
cannot  sense  the  RTS  signal  and  is  not  aware  of  the  transmission.  So  the  third  condition  is  that  no  hidden  nodes  transmits  in 
the  preceding  t rts  time.  If  all  of  the  three  conditions  are  satisfied,  the  RTS  will  be  successfully  received  by  the  AP,  which 
then  replies  with  a  CTS. 

Irrespective  of  whether  they  have  received  the  RTS  or  not,  all  the  nodes  in  the  source  station’s  CS_Range  will  not  transmit 
within  a  Distributed  Interframe  Space  (DIFS)  time.  So  they  will  not  interfere  with  the  CTS  which  is  sent  a  Short  Interframe 
Space  (SIFS)  time  after  the  completion  of  the  RTS,  because  a  SIFS  is  shorter  than  DIFS.  So  only  hidden  nodes  might  interfere 
with  the  CTS.  Therefore,  the  condition  for  the  CTS  to  be  successful  is  that  no  hidden  node  launches  transmissions  at  the 
same  time  slot  as  the  AP  transmits  a  CTS.  Since  all  nodes  in  the  network  are  within  the  AP’s  TX_Range,  they  will  find  the 
channel  busy  during  the  CTS  transmission  and  not  transmit.  Once  the  RTS/CTS  handshake  is  successful,  the  transmission 
is  guaranteed  to  be  successful,  since  all  stations  will  set  their  Network  Allocation  Vector  (NAV),  which  keeps  them  silent 
during  the  transmission.  Consequently,  the  conditional  success  probability  Ps(i)  given  that  a  node  in  annulus  A(i)  initiates  a 
transmission  is 

M  M 

pa(i)  =  n[i-^r(^fl[i-^r(a-i)^ 

.7=1  3=1 

M 

=  []_  _  T^j'jjN[E(iJ)+02P—l)H(.i,3) ]  (109) 

.7=1 

Then  the  conditional  collision  probability  Pc{i)  given  that  a  node  in  annulus  A(i)  initiates  a  transmissions  is 

M 

pc(i)  =  i  -  n  [i  ~  (no) 

3=1 

With  (107)  and  (110),  we  have  2 M  equations  and  2M  unknowns.  Numerically  solving  the  equations,  we  obtain  r(z)  and 
Pc(i )  for  i  from  1  to  M. 

C.  Throughput 

Normalizing  the  area  of  annulus  A(i)  with  respect  to  the  network  area,  we  get 

A(i)  =  [i2  -  (i  -  1)2]/M2,  *  =  1,2,..., M.  (Ill) 

Since  we  assume  that  nodes  are  randomly  located  in  the  network,  the  expected  number  of  stations  located  in  annulus  A(i)  is 
proportional  to  the  normalized  area  of  annulus  A(i): 

N(i)  =  NA(i)  =  N[i2  -  (i  -  1  )2]/M2,  i  =  1, 2, . . . ,  M. 


(112) 
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The  probability  that  all  stations  are  idle  in  a  time  slot  is 

M 

Pidle  =  (113> 

i= 1 

As  the  transmission  probability  r(i)  is  very  small  and  independent  for  each  node,  the  probability  that  there  is  a  successful 
transmission  in  a  time  slot  can  be  approximately  expressed  as 

M 

Psuccess  =  ^2N(i)T(t)(l-Pc{i))  (114) 

i= 1 

The  three  different  channel  states  in  a  time  slot  are:  Successful  Transmission,  Collision  and  Idle.  So  the  probability  that  there 
is  a  collision  in  a  time  slot  is 


P collision  —  1  Pi 


success 


-P 


idle 


(115) 


Since  most  collisions  occurs  when  a  node  launches  RTS  during  the  transmission  of  a  RTS  by  another  node,  the  collision  period 
is  just  the  overlapping  of  these  two  RTS’s.  So  (3  approximates  twice  of  l  rts- 

Let  S  be  the  expected  aggregate  throughput  for  the  whole  network,  which  can  be  computed  in  this  way 

S  =  PsuccessE[Payload\/T  (116) 

where  E[Payload]  is  the  expected  payload  length  of  a  packet  and  T  is  mean  duration  of  a  time  slot 

T  =  Pidled  +  Psuccess Q  +  PcollisionP-  (H7) 

The  per-node  throughput  for  a  node  in  annulus  A(i)  is 

S(i)  =  r(i)(l  —  Pc(i))E[Payload\/T  (118) 


D.  Simulation 

To  validate  the  model,  we  have  run  extensive  simulations  with  the  Network  Simulator  ns-2  v2.33  [30], 

1)  Simulation  Settings:  The  system  parameters  for  the  simulations  and  for  obtaining  numerical  solutions  from  the  model 
have  been  set  to  values  typical  in  802.1  lg  WLAN,  as  summarized  in  Table  I.  The  capture  effect  has  been  turned  off  and  the 
channel  is  free  of  errors.  The  AP  is  located  at  the  center  of  the  network,  while  the  other  stations  are  uniformly  located  within 

TABLE  I 

Parameters  Used  in  Both  Simulations  and  Model 


Preamble  Length 

20  ns 

Slot  Time 

9  fis 

PLCP  Header  Length 

4  ns 

SIFS 

10  /is 

Max  Propagation  Delay 

o.5  ns 

DIFS 

28  /xs 

W  (CWMin) 

31 

m  (CWMax) 

1023 

the  TX_Range  of  of  the  AP.  The  AP  does  not  initiate  transmissions,  while  the  other  stations  transmit  UDP  packets  to  the  AP 
at  6  Mbps.  From  our  previous  experimental  measurements  for  802.1  lg  WLAN,  CS_Range  is  between  1.2  TX_Range  and  1.6 
TX_Range  [27].  We  also  note  that  many  existing  papers  assume  CS_Range=TX_Range,  which  is  not  realistic.  Taking  the 
measurement  results  and  the  prevailing  assumption  into  consideration,  we  decided  to  run  simulations  and  obtain  numerical 
values  from  the  model  with  CS_Range  set  to  1.0  TX_Range,  1.3  TX_Range  and  1.6  TX_Range  respectively. 

2 )  Performance  of  Saturated  Networks:  In  this  subsection,  we  present  the  results  from  both  analysis  and  simulations  for 
saturated  networks.  There  are  16  uniformly-located  nodes  in  the  network.  All  nodes  are  at  full  load  and  the  payload  size  has 
been  fixed  to  1500  bytes.  The  RTS/CST  handshake  has  been  turned  on. 

Fig.  7  compares  the  aggregate  network  throughput  obtained  from  the  analytical  model  and  simulations.  As  shown  in  this 
figure,  the  model  is  very  accurate,  since  the  average  difference  between  the  model  and  simulation  results  is  around  4%. 
Furthermore,  this  accuracy  is  observed  for  networks  ranging  from  4  nodes  up  to  as  many  as  64  nodes.  Besides,  we  note  that 
the  aggregate  throughput  does  not  change  much  with  y. 

Fig.  8  and  Fig.  9  show  the  per-node  throughput,  collision  probability  and  delay  as  a  function  of  the  distance  from  a  node  to 
the  AP.  The  distance  has  been  normalized  with  respect  to  the  TX_Range.  From  Fig.  8,  we  can  see  that  the  per-node  throughput 
from  our  model  matches  its  counterpart  from  simulations  well.  In  addition,  as  we  can  see  from  Fig.  8  and  9,  both  the  analytical 
model  and  simulations  reveal  a  significant  unfairness  in  terms  of  both  throughput  and  delay  for  nodes  at  different  locations. 
For  t]=  1 .0,  1.3  and  1.6,  the  throughput  for  the  closest  node  to  the  AP  is  eight,  five,  and  three  times  as  much  as  that  for  a  node 
at  the  edge  of  TX_Range,  respectively.  The  scenario  y  =  1.0,  which  is  not  realistic,  is  only  for  comparison  with  the  literature. 
So  in  practice,  the  throughput  ratio  between  the  closest  and  the  furthest  node  ranges  from  three  to  five.  It  is  also  clear  that  the 
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Throughput  Per  Node  VS.  Distance 


Fig.  8.  Per-node  Throughput:  Model  VS.  Simulation. 


Collision  Prob.  (from  Model) 


Delay  (from  ns-2) 


Fig.  9.  Collision  Probability  and  Packet  Delay. 


Normalized  Distance 


Fig.  10.  ns-2  simulation  results  for  a  saturated  16-node  network  with  RTS/CTS  turned  on  or  off  and  various  payload  length. 


Fig.  11.  ns-2  simulation  results  of  VoIP  service. 
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unfairness  becomes  less  significant  as  77  increases.  Please  note  that  we  have  assumed  no  channel  errors  in  the  model  and  set 
the  simulator  free  of  channel  errors,  therefore,  this  unfairness  has  nothing  to  do  with  the  path  loss  which  causes  more  packet 
loss  for  stations  far  from  the  AP  than  for  stations  close  to  the  AP 

We  have  also  run  simulations  with  different  network  sizes  (i.e.,  the  number  of  nodes  in  the  network),  and  obtained  similar 
results  as  Fig.  8  and  9.  Due  to  space  limitation,  the  results  are  not  shown  in  this  paper. 

3)  Simulation  Results:  Networks  Without  RTS/CTS:  All  the  previous  sections  are  about  networks  with  RTS/CTS  handshake. 
We  note  that  RTS/CTS  mechanism  is  disabled  in  most  deployed  802.11  WLANs.  To  investigate  the  hidden  terminal  problem 
in  this  case,  we  have  run  simulations  with  RTS/CTS  disabled. 

We  have  considered  three  scenarios  with  packet  payload  length  set  to  125,  500  and  1500  Bytes.  The  other  simulation  settings 
are  exactly  the  same  as  those  in  section  III-D2. 

The  per-node  throughput  are  shown  in  Fig.  10.  As  we  can  see,  with  //  =  1.3  and  a  1500-Byte  payload  length,  the  throughput 
ratio  between  the  closest  node  and  the  furthest  node  is  760  with  RTS/CTS  disabled  and  5  with  RTS/CTS  enabled.  This  shows 
that  the  RTS/CTS  mechanism  does  alleviate  the  performance  unfairness  due  to  hidden  terminals,  although  it  cannot  completely 
solve  the  problem. 

E.  Impact  on  Real-Time  Service 

To  evaluate  the  the  impacts  of  the  performance  unfairness  to  real-time  services,  we  have  done  simulations  with  Voice  over 
IP  (VoIP)  services  in  ns-2.  We  have  adopted  the  ns-2  VoIP  framework  established  by  [31]  and  [32], 

The  VoIP  framework  evaluates  the  Quality  of  Service  (QoS)  of  VoIP  sessions  in  terms  of  Mean  Opinion  Score  (MOS)  [33]. 
The  higher  the  value  of  MOS,  the  better  the  QoS  is.  The  quality  of  VoIP  service  is  considered  to  be  good  if  the  MOS  is  above 
4  [34], 

In  the  simulations,  each  node  initiates  two  VoIP  sessions  and  talks  to  the  AP.  All  the  other  settings  are  the  same  as  those 
in  section  III-D2. 

Fig.  11  shows  the  QoS  of  the  VoIP  services  for  nodes  at  different  locations.  As  we  can  see,  the  QoS  does  not  depend  on 
location  when  there  are  no  hidden  nodes  (77  =  2.0);  however,  it  greatly  depends  on  location  in  the  presence  of  hidden  nodes 
(?7  =  1.0, 1.3, 1.6).  Besides,  the  unfairness  becomes  less  significant  with  a  greater  77.  Note  that  when  there  are  no  hidden 
nodes  (77  =  2.0),  the  VoIP  sessions  for  all  nodes  are  with  good  quality  (MOS>4).  This  indicates  that  the  network  is  non- 
saturated.  Therefore,  hidden  terminals  cause  significant  performance  unfairness  not  only  under  saturated  load,  but  also  under 
non-saturated  load. 


F.  Experimental  Measurements 

We  have  set  up  a  testbed  to  measure  the  throughput  and  packet  loss  rate  in  a  WLAN  with  hidden  nodes  and  single  access 
point,  houses  and  WiFi  hot  spots,  which  keeps  the  experiments  free  from  external  interference. 

We  have  used  five  laptop  computers  (A,  B,  C,  D  and  E)  running  Linux  Fedora  5  (kernel:  2.6.16-prep).  Each  laptop  is 
equipped  with  an  Atheros  802.1  lg  wireless  card  (chipset:  Atheros  AR5212)  and  uses  the  MadWifi  driver  [35]  (version  0.9.4). 
To  facilitate  experimentation,  the  transmission  power  of  each  wireless  card  is  fixed  to  1  dBm  (1.26  mW),  which  is  greater  than 
the  minimum  transmission  power  (1  mW)  required  by  the  802.11  standard. 

The  topology  of  our  experiments  is  shown  and  Fig.  12.  All  laptops  are  in  a  line,  with  laptop  C  in  the  middle.  Laptop  C  acts 
as  the  AP  and  does  not  initiate  transmissions;  while  the  other  laptops  are  at  full  load  and  transmit  UDP  packets  to  C  with  a 
fixed  payload  length  of  1470  Bytes. 

We  first  measure  the  TX_Range  with  a  method  which  is  similar  as  the  one  used  by  [25]  and  [36].  In  our  measurements, 
TX_Range  is  defined  as  the  largest  transmission  distance  when  the  packet  loss  rate  is  maintained  below  10%.  Let  r  denote 
TX_Range,  then  CS_Range  is  between  1.2r  and  1.6r  [27],  As  observed  from  the  topology  in  Fig.  12,  nodes  B  and  D  which 
are  close  to  the  access  point  see  one  hidden  node,  while  nodes  A  and  E  which  are  far  away  from  the  access  point  see  two 
hidden  nodes.  For  example,  E  is  1.6r  away  from  B,  so  B  sees  E  as  a  hidden  node;  both  nodes  D  and  E  are  more  than  1.6r 
away  from  A,  so  A  sees  both  of  them  as  hidden  nodes. 

With  RTS/CTS  handshake  turning  on  or  off  and  the  data  rate  for  all  nodes  setting  to  6Mbps  or  36Mbps,  we  made 
measurements  for  four  scenarios.  The  measured  throughput  and  packet  loss  rate  for  each  node  are  presented  in  Fig.  13 
and  Fig.  14,  respectively.14  These  two  figures  tell  us  three  things.  Firstly,  the  performance  of  close  stations  (node  B  and  node 
D)  are  much  better  than  that  of  far  stations  (node  A  and  node  E),  in  terms  of  both  throughput  and  packet  loss  rate.  This  is 
contrary  to  the  case  without  hidden  nodes,  where  all  nodes  enjoy  equal  chance  to  access  the  channel  and  the  throughput  for 
all  nodes  are  almost  the  same,  even  if  they  transmit  at  different  data  rates  [37], 

Secondly,  in  both  the  6Mbps  and  36Mbps  scenarios,  enabling  RTS/CTS  significantly  enhances  the  performance.  This 
contradicts  with  the  ad-hoc  case,  where  RTS/CTS  handshake  is  not  effective  in  dealing  with  the  hidden  node  problem  [26]. 

14As  in  [25],  we  also  experienced  a  high  variability  in  channel  conditions  thus  making  a  comparison  between  the  exact  values  of  the  results  difficult 
sometimes. 
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Fig.  12.  Topology:  All  stations  are  in  a  line,  with  access  point  in  the  middle. 


6M,  w/o  RTS  6M,  w/  RTS  36M,  w/0  RTS  36M,  w/  RTS 


Fig.  13.  Measured  Per-node  Throughput. 


This  is  because  for  a  given  distance  between  the  transmitter  and  receiver,  there  are  more  hidden  nodes  in  ad  hoc  networks 
than  in  single-cell  WLANs,  which  can  be  seen  clearly  by  comparing  Fig.  2  with  Fig.  3a. 

Thirdly,  comparing  the  Packet  Loss  Rate  (PLR)  in  scenarios  “6Mbps  without  RTS”  and  “36Mbps  without  RTS”,  we  find 
that  the  PLR’s  for  close  and  far  stations  in  the  former  scenario  are  about  20%  greater  than  their  counterparts  in  the  latter 
scenario.  This  agrees  with  our  analysis  very  well:  since  the  transmission  time  of  DATA  packet  at  6Mbps  is  much  longer  than 
that  at  36Mbps,  hidden  nodes  have  more  chances  to  initiate  transmissions  during  the  data  packet  transmission  period,  which 
gives  rise  to  more  collisions. 

As  we  know,  packet  loss  is  mainly  due  to  channel  errors  and  collisions.  As  mentioned  previously,  we  define  TX_Range 
as  the  maximum  transmission  distance  when  the  packet  loss  rate  is  maintained  below  10%.  Since  all  nodes  are  within  the 
TX_Range  of  of  the  access  point,  the  packet  loss  caused  by  channel  errors  is  less  than  10%.  From  Fig.  14  we  can  see  that 
most  measured  PLR’s  are  far  more  than  10%,  so  collisions  are  the  dominating  contributor  to  packet  losses. 

G.  Related  Work 

The  existing  research  on  evaluating  the  impacts  of  hidden  terminals  to  the  performance  of  802.11  networks  can  be  roughly 
categorized  as  follows.  There  are  simulation-based  studies  [24]  and  analytical  models  [38]-[23] .  A  few  papers  also  evaluate 
the  impacts  of  hidden  nodes  with  experiments  [23] [36].  Our  work  belongs  to  all  the  the  three  categories. 


6M,  w/o  RTS  6M,  w/  RTS  36M,  w/o  RTS  36M,  w/  RTS 


Fig.  14. 


Measured  Packet  Loss  Rate  (PLR). 
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Among  the  current  analytical  models,  [38]-[39]  are  the  closest  to  ours.  These  results  can  be  classified  into  two  groups,  based 
on  whether  they  utilize  the  relationship  between  the  transmission  distance  and  the  number  of  hidden  nodes,  which  is  revealed 
in  section  III-A  of  this  paper.  In  one  group,  the  papers  do  not  utilize  this  relationship.  In  both  [38]  and  [40],  Chhaya  and 
Gupta  build  an  analytical  model  for  802.1 1  WLAN  in  the  presence  of  hidden  nodes  and  capture  effects.  The  model  reveals  the 
unfairness  that  the  nodes  far  away  from  the  access  point  (AP)  have  a  less  success  probability  of  transmission  than  those  close 
to  the  AP.  However,  the  result  is  based  on  the  assumption  that  every  pair  of  nodes  are  hidden  from  each  other  with  a  constant 
probability,  which  holds  only  for  a  few  network  topologies.  Similarly,  [22]  evaluates  the  network  performance  by  building  a 
model  assuming  that  the  number  of  hidden  nodes  is  equal  for  all  transmissions.  In  the  other  group,  the  relationship  between 
transmission  distance  and  the  number  of  hidden  terminals  is  exploited.  Both  [41],  a  seminal  work  from  1984,  and  recent  work 
[39]  show  the  relationship  for  a  special  case  where  CS_Range  is  the  same  as  TX_Range,  though  CS_Range  is  usually  greater 
than  TX_Range  in  practice.  The  focus  of  both  papers  is  the  analysis  of  the  network  performance. 

Experimental  studies  of  the  hidden  terminal  problem  has  been  very  rare.  Ng  and  Liew  in  [23]  have  measured  the  aggregate 
network  throughput  of  a  6-node  ad-hoc  network  with  hidden  nodes.  It  does  not  conclude  the  performance  unfairness  problem. 

Our  paper  is  distinct  from  the  existing  work  on  the  hidden  terminal  problem  in  the  following  three  ways.  Firstly,  none  of 
them  answer  why  hidden  terminals  cause  the  performance  unfairness  problem.  Secondly,  no  effort  has  been  put  on  experimental 
measurements  to  validate  this  performance  unfairness  problem.  Finally,  our  model  is  one  of  the  few  models  that  focus  on  the 
performance  for  individual  nodes  rather  than  the  overall  network. 

IV.  Conclusion  and  Future  Work 

The  first  work  presented  an  on-line  adaptive  policy  for  stabilization  and  optimal  control  of  wireless  networks  operating  under 
rechargeable  batteries.  Using  a  Lyapunov  drift  argument  and  modifying  the  framework  of  [15]  in  subtle  but  non-trivial  ways, 
a  performance  bound  was  provided  that  ensures  asymptotic  optimality  as  Emax  — >  oo  (or  Emax/P  — >  oo,  to  be  exact).  The 
policy  requires  only  current  information  and  is  particularly  suitable  for  satellite  or  wireless  networks,  which  typically  operate 
under  high  Emax/P  ratios. 

Future  work  includes  extensive  numerical  evaluation  of  the  proposed  policy,  in  order  to  develop  practical  rules  for  selecting 
the  Emax/P  ratio  (i.  e.  how  large  should  the  ratio  be  so  that  the  performance  loss  is  negligible),  since  the  theoretically  derived 
bounds  are  rather  loose.  The  basic  intuition  may  also  be  applicable  to  finite  (non-rechargeable)  batteries  so  that  a  nearly  optimal 
policy  may  be  developed  for  the  problem  of  lifetime  maximization.  This  problem  is  expected  to  require  a  completely  new 
formulation  and  tools,  and  is  of  a  more  speculative  nature. 

In  the  second  part  of  this  project,  we  studied  performance  unfairness  in  802.11  networks  due  to  hidden  nodes.  We  build  an 
analytical  model  and  validated  it  with  simulations.  We  also  set  up  a  testbed  and  made  experimental  measurements.  From  our 
analysis,  simulation  and  measurements,  we  conclude  that  the  performance  of  a  node  in  a  randomly  distributed  802.11  WLAN 
depends  on  the  distance  from  the  node  to  the  AR  More  specifically,  the  nodes  close  to  the  AP  get  much  better  performance 
than  those  far  from  the  AP.  Although  our  work  is  based  on  802.11  networks,  we  believe  it  also  applies  to  most  random-access 
radio  networks. 

As  future  work,  we  are  planning  to  extend  the  analysis  for  ad  hoc  networks.  We  will  also  design  a  MAC  layer  scheme  to 
mitigate  the  performance  unfairness. 


V.  Appendix 


A.  Proof  of  Lemma  2 

We  will  need  the  following  result  from  analysis 

Lemma  V-A.l:  For  any  functions  /i(t),  /2(f))  w{t)  such  that  fi(t)  +  w(t)  =  /2(f)  and  limt—^  w(t)  =  0,  it  holds 

liminf  f\(t)  =  liminf  /2(f),  (119) 

t—>  00  t—>o o 

with  a  similar  equality  for  limsup. 

Proof:  Since  w(t)  — ►  0  as  t  — >  00,  it  follows  that  (— w(t ))  — >  0  as  t  — ►  00  and,  hence,  lim  inf  (_>,*)  w(t)  =  0, 
lirn  inf't^oo  (—w(t))  =  0.  Using  standard  properties  of  liminf,  we  have 


liminf  fi(t)  =  liminf  (/2(f)  +  (—w(t)))  >  liminf  f2(t )  +  liminf(— w(t))  =  liminf  /2(f), 

t— >oo  t—>  00  t — too  t—>  oo  t — > oo 

liminf  f2(t)  =  liminf  (/i(t)  +  w(t))  >  liminf  fi(t)  +  liminf  w(t)  =  liminf  fi(t), 

t—>  00  t—>o o  t—>  oo  t—>  oo  t—>  oo 

which  completes  the  proof. 

Following  the  approach  in  [16],  we  now  take  an  expectation  of  (17),  which  produces 


E  [Ly  (X(k  +  1))  -  Ly  (X(k))}  -  VE  [g  ( Z(k ))]  <  C  -  Vg*  +  E [Q(k)}, 


(120) 


(121) 
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and  sum  the  above  telescoping  series  for  k  =  0, , . . ,  f  —  1  to  arrive  at 

t- 1  t-i 

E  [Ly  (X(f))]  -  E  [Ly  (X(0))]  —  V  E  E  [g  (. Z(k ))]  <  (C  -  V<?*)f  +  £  E[Q(fc)]. 


(122) 


/c— 0 


/c— 0 


Since  Ly(X(t))  >  0,  the  above  inequality  can  be  strengthened  to 


t- 1 


t- 1 


(IV  -C)t<J2  EIQ  V]  +  E  [V  (*(0))]  +VY,El9  (Z(k))} 


k—0 
t- 1 


k—0 


t- 1 


=v  -  f  -  77  E  ^  E  [Ly  (*(«))]  +  7  E  E  [5  (^(*))]  • 

1  fe=0  ^  fc— 0 


(123) 


It  also  holds 


E  E  id  (Z(k))\  <  7  E  5  (E  [Z[k)\)  <2  (7  EE  ’ 


f 


(124) 

fc= 0  "  fc= 0  \ '  fe=0  ) 

where  the  first  inequality  is  due  to  Jensen’s  inequality  and  the  second  one  follows  from  the  concavity  of  g.  The  reader  will 
recognize  the  argument  of  g  in  the  last  RHS  as  z(t).  Combining  (123),  (124)  yields 

t-i 


9*  -  y  -  Yt  ^  E[Q{k)]  -  177 E  [Ly  (x(0))]  +  9  {Ht))  • 


fc= o 


Vt 


(125) 


We  now  take  a  lirninf  in  the  last  relation,  exploit  the  properties  lim  inf  77  >  J^liminf  and  liminf(— xn)  =  —  limsupxn, 
and  use  Lemma  V-A.l.  Since  E  [Ly  (X(0))]  /{Vt)  — »  0  as  t  — >  cxd,  it  finally  holds 


9*  -  77  -  T7  lim  sup  J  E  [Q{k)\  <  lirninf  g  ( z{t )) . 
V  V  t— oo  t  f—f  i^oo 

k—0 


(126) 

□ 


B.  Proof  of  Lemma  3 

The  deterministic  bounds  of  (23)  are  proved  as  follows.  Examine  (12)  and  note  that  it  is  a  separable  concave  maximization 
over  a  compact  set.  Separability  allows  us  to  maximize  individually  over  l  so  we  seek  the  maximum  of  V fi{"fi)  —  rjYi{f)'yi. 
It  clearly  follows,  by  computing  the  first  derivative  V f/i'Yi)  —  ijYi(i),  that  if  it  holds  V f{ (0)  —  pYi{t)  <  0,  the  function 
V fiOn)  —  riY[(t)ji  is  non-increasing  (since  concavity  of  f)  implies  f'{"fi)  <  //( 0))  and  therefore  the  maximum  is  achieved  at 
7 1  =  0. 

The  bound  for  Yf  t)  is  now  proved  by  induction.  The  bound  is  trivially  true  for  t  =  0.  Assume  now  that  it  holds  for  some  t, 
i.  e.  Yi{t)  <  V f [ (0) / 77  +  Ai  and  distinguish  cases.  It  holds  either  Yi(t)  <  V f 1(0)/ rj  or  V fl(0)/rj  <Yi(t)  <  V f ((0)/ rj  +  Ai.  In 
the  former  case,  (9)  and  the  bound  7 i(t)  <  Ai  immediately  imply  the  bound  for  Y[(t  +  1),  while  in  the  latter  case  we  exploit 
the  fact  that  in  this  range  of  Y;(t)  it  holds  7 i(t)  =  0  so  that  Yi(t  +  1)  <  Y(t)  and  the  bound  is  again  proved. 

For  the  bound  on  Ui(t),  note  that  Ui(t)  >  rjYi  implies  Ui(t)  >  rjYi(t),  whence  it  follows,  from  separability  of  (11)  and 
its  “bang-bang”  solution,  that  Ritin(t)  =  0.  The  bound  is  now  proved  by  induction  on  t  and  distinction  of  cases,  in  a  similar 
manner  as  in  the  previous  paragraph.  Finally,  for  the  bound  on  D(t),  we  exploit  the  assumptions  for  <7  f.s.  p)  in  Section  II-A 
and  write  for  all  t  and  p  £  Y S(tj 


Ui(t)ci  (5(f),  p)  -  D(t)pi  <  Ui(t)ci  (5(f),  (0, ...  ,pi, ...  ,0))  —  D(t)pi  Ml 


(127) 


=^E  iui(0ci  (S(t),p)  -  D(t)pi\  <  [ Ui(t)ci  (5(f),  (0,  ...,pi,...,  0))  -  D(t)pi] . 

1  =  1  1  =  1 

Consider  the  maximization  of  the  last  RHS  in  (127)  over  the  set  |p  G  '■  Pi  <  min  ^5(f),P^  Vf|,  which  is  actually 

a  superset  of  the  constraint  set  of  (13),  and  denote  its  solution  as  P(t).  The  new  problem  is  separable  and  a  repetition  of 
the  derivative  argument  of  the  previous  paragraphs  reveals  that  it  holds  P/(f)  =  0  when  Ui(t )^  (5(f),  0)  <  D(t).  Hence, 

D(t)  >  maxi <i<l  (UiCi'j  implies  P(t)  =  0,  which,  combined  with  (127),  yields 

L 

E  [UliOci  ( S(t),p )  -  D(t)pi\  <  0. 


;= 1 


(128) 
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On  the  other  hand,  the  last  inequality  becomes  an  equality  when  p  =  0.  Hence,  with  respect  to  the  original  problem  in  (13), 
it  holds  Pft)  =  0  whenever  Dft )  >  maxi<;<£  (UiCi'j.  Using  this  statement  in  combination  with  (10),  the  bound  for  D(t)  is 
proved  by  induction  on  f,  similarly  to  the  previous  bounds. 

□ 


C.  Proof  of  Lemma  4 

It  obviously  holds  poutft)  <  poutft)  for  all  t.  This  implies  poutft)  <  poutft),  which  combined  with  the  monotonicity  of  cj 
and  the  liminf  operator  immediately  produces  the  second  inequality  of  Lemma  4.  Hence,  it  remains  to  prove  the  first  equality.  It 
is  shown  in  [15]  that  for  any  stable  queue  Zft)  with  E[Z(0)]  <  oo  and  a  bounded  service  process  pout{t)  (i.  e.  poutft)  <  Cl 
for  a  sufficiently  large  C)  it  holds  lim^oo  E[.Z(i)]/t  =  0.  It  also  holds 

Z( 0)  +  E  Ain(r)  =  Zft)  +  £  pout(r)  =*  E[Z{0)  ~  Z(t)]  +  ain(t)  =  fiout(t),  (129) 

r— 0  r=0 

where  the  =>■  part  follows  from  taking  expectations  and  dividing  by  t.  Since  E[Z(0)  —  Zft)\/t  — >  0,  it  follows  that  for  all 
e  >  0  there  exists  some  T(e)  such  that  for  all  t  >  T(e)  it  holds 

O-init)  —  el  <  p,outft),  (130) 

P'outW  ~ 

If  we  pick  e  <  C,  (130)  implies  din(t)  <  2 C  for  all  t  >  T(e).  Consider  now  the  function  g  with  domain  the  L-dimensional 
cube  of  length  2 C,  i.  e.  g  ■  xf=i  [0  2C]  h— >  E.  Since  the  domain  set  is  compact  and  g  is  continuous,  the  Heine-Cantor  theorem 

asserts  that  g  is  uniformly  continuous  in  its  domain.  Hence,  for  any  ei  >  0  there  exists  an  e  (depending  on  e\  only  and  not 
x)  such  that 

\g(x)  -  g(x  -  el)\  <  a  =>  g(x  -  el)  >  g(x)  -  ex  Vx.  (131) 


We  now  set  x  =  a,„(f)  and  x  =  fiout(t )  in  (131)  (these  x  obviously  belong  to  the  domain  set  of  g  for  t  >  T(e))  so  that  the 
previous  relation  becomes 

g  (aj„(f)  -  el)  >  g  ( ain(t ))  -  ei, 

g  iKutit)  el  )>g  (p-outi*))  -  £i> 


(132) 


for  all  t  >  T(e).  Combining  the  monotonicity  of  g  with  (130),  (132),  it  follows  that  for  a  given  e\  >  0  there  exist  e(ei)  and 
T(e)  such  that  for  all  t  >  T(e)  it  holds 


g  ( ain{t ))  -  ei  <  g  ( ain{t )  -  el)  <  g  (, fiout(t )) , 

g  (Uouti*))  -  £i  <  g  (Kut(t)  - el)  <  g  (ain{t)) . 

Taking  a  liminf  in  the  last  equations  produces 

liminf  g  (din(i))  -  ei  <  liminf  g  ( fiout(t )) , 

t—>  oo  t—>  OO 

liminf  g  ( pout{t ))  -  a  <  liminf  g  ( am(t )) , 

t — ►oo  t — >-oo 

and  using  the  fact  that  ei  is  arbitrary  completes  the  proof. 


(133) 


(134) 


□ 


D.  Proof  of  Lemma  5 

We  assume  w.  1.  o.  g.  that  the  lost  recharge  energy  quanta  of  Fig.  1  enter  a  virtual  queue  Eover(t)  of  zero  service  rate,  so 
that  Eover(t)  is  a  non-decreasing  function  of  time.  Viewing  the  queues  E(t),  D(t),  Eover(t)  of  Fig.  1  as  a  single  compound 
queue,  this  compound  is  unstable  since  it  has  an  instantaneous  arrival  rate  of  B(t)  and  a  corresponding  service  rate  of  at  most 
(1  —  6)B(t).  However,  queues  E(t),  D(t)  are  finite,  the  former  by  definition  and  the  latter  due  to  Lemma  3,  which  implies 
that  lim^oo  Eover(t)  =  oo  w.  p.  1.  Since  Eover(t)  is  increased  only  when  Eft)  overflows,  it  follows  that,  under  DRABP,  the 
queue  Eft)  overflows  infinitely  often  so  that 


Pr 


(OO  OO  \  /  oo  oo 

n  uw) = = 1  ^ pr  u  n w)  <  i  =  °- 

i=0j=i  J  \t=0  j—t 


(135) 


The  RHS  of  (135)  also  implies  Pr  (fl?Lt{.E(j)  <  Emax\)  =  0  for  all  t.  We  define  the  random  variable  f  =  infjf  :  Eft)  = 
Emax}  as  the  first  time,  under  DRABR  the  battery  hits  Emax  (obviously,  f  =  0  if  -E(O)  =  Emax).  Clearly,  f  is  finite 
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w.  p.  1  since  Pr  (f  =  oo)  =  Pr  (n}T0{f?(j)  <  Emax})  =  0,  due  to  the  implication  of  (135).  We  also  construct  the  decreasing 
event  sequence  A?  =  {f  >  j},  so  that  it  holds  r\JL0Aj  =  {t  =  oo}.  From  standard  probability  theory  we  know  that 
Pr(Aj)  — »  Pr(f  =  oo)  =  0  as  t  — >  oo. 


We  now  exploit  limsup  properties  to  write 


limsup  -  V  Pr  (E(j)  <  P\  =  limsup  -  Pr  (E(j) 
t.—>  oo  t  .  „  V  /  t— >oo  t  ...  I  ' 


<  P,T  <  j)  +  Pr 


2=0 


3=0 


(m 


<P,T>j 


1  *_1 

<  limsup  -  V''  Pr  (E(j)  <  P , 

t — >00  t  .  „  \ 


3=0 
t- 1 


t- 1 


f  <  j  )  +  limsup  -  Pr  (E(j)  <  P,t  >  j 
t—>  oo  t.  V 
2=0 

t—1 


(136) 


<  limsup  -  Pr  (E(j)  <  P,t  <  j)  +  limsup  -  Pr  (f  >  j ) . 
t—>00  t  V  2  t—>00  t 

3=0  2=0 

Since  Pr(f  >  j)  — ►  0  as  j  — >  oo,  it  follows  that  for  any  e  >  0  there  exists  a  J  such  that  Pr(f  >  j)  <  e  for  all  j  >  J.  Hence, 
for  t  >  J  it  holds 

J 

I  , — ,  ./-HI  T.  —  .1  —  I 

(137) 


1  V— '  _  ,  _  ..  1  V— '  .,  1  \  '  "v  .7+1  t  —  J  —  1 

-  yj  Pr  (T  >  j)  =  7  yj  Pr  (T  >  j)  +  7  yj  Pr  (r  <  j)  <  — —  + - - - e 


2=0 


t 


2=0 


t 


2=J+1 


f 


t 


Taking  a  limsup  in  the  last  equation  and  exploiting  the  fact  that  e  is  arbitrary  yields  limsupt^00(l/f)  Pr(f  >  j)  =  o, 

so  that  we  only  need  to  compute  the  left  limsup  in  the  last  line  of  (136).  We  will  now  prove  the  following  bound  for  all  j 


,  „  \  ?  /  7-1 

Pr  ( E(j )  <  P,t  <  j)  <  Pr  yy  B(k)  < 

\  k  =  Tn 


P  ~\~  D  —  En 


f  a  — l 


<  Pr  (  < 


P  +  D  —  Err 


(138) 


\k=  0 


where  Tj  =  sups<J-{s  :  E(s)  =  Emax }  is  the  last  time  up  to  (and  including)  j  when  the  battery  hit  Emax.  This  quantity  is 
well  defined  for  all  j  with  f  <  j. 

If  r  =  j,  then  tj  =  j,  E(j)  =  Emax  and  {E(j)  <  P,  t  <  j}  =  0,  so  that  (138)  holds  trivially.  Consider  now  the  event 
{E(j)  <  P,f  <  j}.  It  holds 


{m 


E  —  P 

max  1 

*  i 

j  c  <3  -  Tj  > 

P 

(139) 


since  the  fastest  way  the  battery  can  drop  from  Emax  (its  value  at  Tj)  to  E(j)  is  by  transmitting  with  peak  power  and  receiving 
zero  recharge  for  all  intermediate  slots.  Examining  the  combined  queue  E(t),  D(t)  in  the  interval  [tj  j ]  yields 

2-i  2-i 

E(j)  +  D(j)  -  E(Tj)  -  D(Tj)  >  5  yy  B(k)  =>  E(j)  +  D  —  Emax  >  6  Y  B(k),  (140) 


k=Tj 


k=Tj 


where  the  last  RHS  is  inferred  by  the  boundedness  of  D(t).  Note  that  (140)  holds  for  any  j  such  that  E(j)  <  Emax  and 
f  <  j.  Hence,  the  following  inclusion  is  true 


2-1 


{E(j)  <  P,f  <  j}  c  ^  P  +  D  -  Emax  >  6  yy  B(k)  \  , 


k=r 


and,  due  to  (139),  it  also  holds 


2-1 


Tj+a-1 


P  +  D-  Emax  >  5  yy  B(k)  \  cl  P  +  D~  Emax  >  5  y,  B(fc) )  ■ 


(141) 


(142) 


k=Tj 


k=Tj 


Combining  (141),  (142),  taking  probabilities  in  the  resulting  inclusion  and  using  the  stationarity  of  B(t)  produces  (138).  Since 
the  RHS  of  (138)  is  independent  of  j  (and  therefore  invariant  under  a  convex  combination),  the  proof  is  complete. 


□ 
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E.  Proof  of  Lemma  6 

It  is  easy  to  show  through  simple  algebra  that 

Pr  B{k)  <  P  +  D~E™A  =Pr(/i  J2lB(k)  —  B]<  PpD~E™*  _  b\  .  (143) 

In  the  selected  regime  for  Emax,  it  holds  (P  +  D  —  Emax) / (oS)  —  P  <  0  so  that  we  can  apply  Chernoff’s  bound  [42]  in  the 
form  we  state  below 

Lemma  V-E.l:  For  a  sequence  X±,X2,  ■  ■  ■  of  iid  random  variables,  it  holds  for  all  n  and  a  <  E[xi] 

Pr  (xi  +  X2  +  ■  ■  ■  +  xn  <  na)  <  exp  (—n£(a)) ,  (144) 

where  £(a )  =  sup0  ( da  —  lnE[eSa:i]).  The  following  properties  hold  for  £{•) 

•  £(a)  >0  Va. 

•  a  —  0  £(a)  =  0. 

•  £(a)  is  a  convex  function  of  a. 

Constructing  the  iid  random  variables  Xk  =  B(k)  —  B  (with  E[xfc]  =  0)  and  substituting  a  for  n  in  the  above  lemma 
immediately  produces  the  desired  result. 

□ 


F.  Proof  of  Lemma  7 

We  assume  that  Emax  >  P  +  D  and  prove  Lemma  7  by  standard  induction  starting  from  t  =  f.  The  hypothesis  is  trivially 
true  for  t  =  t,  since  it  holds,  by  definition,  E(t)  =  Emax  and  therefore  E(t)  +  D(t)  >  Emax.  We  now  assume  that  for  some 
t  it  holds  E(t)  +  D(t)  >  Emax,  and  distinguish  cases  for  E(t  +  1) 

•  if  Eft  +  1)  =  Emax,  it  follows  immediately  that  E(t  +  1)  +  D(t  +  1)  >  Emax. 

•  if  E(t  +  1)  <  Emax,  we  use  (3),  (10)  to  write 


E(t  +  l)  =  E(t)-J2m)  +  B(t), 
1=1 

L 

D(t+  1)  >  D(t)  -  (1  -  S)B{t)  + 

i=i 


(145) 


and  add  the  previous  relations  by  parts  to  derive 


E(t  +  1)  +  D(f  +  1)  >  E(t)  +  D(f)  +  1 5B(t)  P  E(f)  +  D(f)  >  Emax, 


(146) 


where  the  last  inequality  follows  from  the  inductive  hypothesis  for  t. 

Hence,  the  hypothesis  is  also  true  for  t  +  1  and  Lemma  7  is  proved. 

As  mentioned  in  Section  II-B4,  the  fact  that  the  battery  will  eventually  hit  Ernax  in  finite  time  and  never  fall  below  P 
afterwards  suggests  that  the  finite  portion  of  time  before  the  first  hit  does  not  affect  the  infinite  horizon  utility.  A  rigorous 
proof  of  this  requires  the  following  intermediate  facts 

Lemma  V-F.l:  Let  f  a  random  variable  taking  values  in  N  with  E[f]  <  oo  and  a  stochastic  process  Z{t)  such  that 
0  <  Z(t)  <  Z  a.s.,  where  Z  is  an  arbitrary  constant.  It  then  holds 


7 E Eiz^ - ^ ^ < 7 E z + fc)] eE Eiz^  +  Z^r  > °- 


k= o 


k= 0 


Proof:  It  holds 


r+t— 1 


t- 1 


t 

k= 0 

t+r— 1 


r— 1 


so  that 


J2z(r  +  k)=  E  Z^  =  E  Z(k)  +  E  ^(fc)-E  Z(k)  Vf, 

k= 0  k=r  k= 0  k=t  k= 0 

1  t- 1  f-1  t- 1  f-1  t- 1 

E^(r+fc)>E  Z^k)  -  E  zw  ^  E  zw  -  E  Z  =  E  z(fc)  -  z^ 


t- i 


k= 0 


k=0 


k=0 


k= 0 


k= 0  k= 0 


E^(r+fc)<Ez(fc)+  E  z(k)<j2z(k)+  E  z  =  Y.zw  +  zi- 


(147) 

(148) 

(149) 


(150) 
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Taking  expectations  in  the  last  two  equations  and  dividing  by  t  completes  the  proof.  ■ 

Combining  Lemmas  V-A.l,  V-F.l  produces  the  following 

Corollary  4:  For  any  random  variable  t  taking  values  in  N  with  E[f]  <  oo  and  stochastic  process  Z(t )  such  that  0  < 
Z(t)  <  Z  a.s.,  it  holds 

,  t- 1  ,  t- 1 

liminf  —  E  [Z(f  +  k )]  =  liminf  — 


k—0 


t  >-oo  t 


(151) 


k- 0 


The  following  result  is  the  key  to  proving  our  intuitive  claim 

Lemma  V-F.2:  Let  t  =  inf  {t  :  Eft)  =  Emax }  be  the  first  time  the  battery  hits  Emax  under  DRAB P.  It  then  holds  E[f]  <  oo. 
Proof:  We  use  the  well-known  identity  for  discrete  random  variables 

OO 

E[t]  =  5ZPr(f  >  •?')>  (*52) 

3=0 

and  examine  the  joint  queues  (7(f),  D(t)  in  the  interval  [0  j).  Since  the  battery  has  never  overflowed  up  to  time  j,  it  holds 


3- 1  3-1 

E(j)+D(j)  -  E( 0)  -  D( 0)  >  6^2  B(k)  =►  Emax  +  D>  6^2  B{k), 

k— 0  k— 0 


so  that  the  following  inclusion  is  true 


3~1 


{t  >  jj  C  <  Emax  +  £>  >  S  y  B(k)  >  . 


f  fc= o  ) 

Taking  probabilities  in  the  last  inclusion  and  performing  some  algebra  similar  to  Appendix  V-E  yields 

3~1  \ 


Pr  (f  >  j)  <  Pr 


JS 


j 


(153) 


(154) 


(155) 


k—0 


Hence  for  j  >  f,  where  j*  =  1  +  \{Emax  +  b)/(SB)],  the  LHS  of  the  inequality  in  the  second  probability  becomes 
negative  so  that  we  can  apply  Lemma  V-E.l  and  exploit  the  monotonicity  of  £(■)  to  get 


Pr  (f  >  j)  <  Pr 


En 


j-i 


jS 


—  -B>-'^2JB{k)  <  exp  (~jf(a*)) 


(156) 


k= 0 


where  a*  =  ( Emax  +  b)/(j*S)  —  B  <  0.  Having  provided  an  exponentially  small  tail  bound,  it  easily  follows  from  a  standard 
procedure  that  E[f]  <  oo. 

Combining  Corollary  4  with  Lemma  V-F.2  finally  proves  the  original  claim.  ■ 


□ 


G.  Proof  of  Lemma  11 

The  boundedness  of  lj^c^(f)  has  already  been  established  in  the  proof  of  Lemma  10.  Also,  if  Un°\t)  is  finitely  bounded  for 
all  n,  c,  then  repeating  the  procedure  of  Appendix  V-B  for  l)n  ( t )  provides  the  bound  of  (94).  Hence,  it  remains  to  prove  the 
bound  for  Although  it  follows  from  (87)  that  f)  =  0  whenever  Un\t)  >  r]YnC\  the  boundedness  of  U„\t) 

cannot  be  proved  solely  through  the  argument  of  Appendix  V-B  due  to  the  additional  term  p\fn  (S(t),  P(t))  of  intra-node 
incoming  traffic.  Hence,  a  new  approach  is  required. 

To  prove  the  desired  result,  we  can  equivalently  prove  the  following 

Lemma  V-G.l:  For  any  policy  whose  application  satisfies  the  conditions  of  Lemma  10  (clearly,  NRABP  is  such  a  policy), 
there  exists  a  finite  sequence  a^\  . . . ,  such  that  it  holds  for  all  f,  c 

max  y  U^c\t)  <  kFW  +  af\  (157) 

\T\=k  *eX 

where  F ^  =  p  maxi<„<jv  Yn^  and  \I\  denotes  the  cardinality  of  I. 

Note  that  the  maximum  is  taken  over  all  sets  of  given  cardinality.  The  above  statement,  if  true,  implies  that  U-c\ t)  < 

N E(c'>  +  a^\  which  in  turn  implies  the  desired  boundedness.  Lemma  V-G.l  will  be  proved  by  induction,  and  the  following 
facts  will  be  used  in  the  process 
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Remark  1:  Consider  any  policy  that  places  an  explicit  upper  bound  on  the  exogenously  admitted  traffic  R„\n{t)  for  all 
nodes  n  and  commodities  c  (clearly,  NRABP  is  such  a  policy).  Then,  there  exists  a  sufficiently  large  C  such  that,  under  this 
policy,  it  holds  for  all  sets  T  C  {1, . . . ,  iV} 

<  J2UiC\t)  +  ^  Vce/C,  (158) 

iei  iei 


where  1C  is  the  commodity  set. 

Remark  2:  Consider  any  index  set  I  C  {1, . . . ,  N}.  This  set  can  be  uniquely  partitioned  as  T  =  1+J^-=1X/  where 

7  A  /  0  if  1  0  X, 

1  \  {l,2,...,iei}  if  {1,2 . *ei}  C  lAiei  +1  gX,  U  J 

and  the  remaining  sets  are  defined  recursively  through 

-^7  =  {*/j  '  hj  +  !)•■•>  }  (160) 


where  ie.  and  if .  are  uniquely  defined  by  the  following  conditions 

•  fj  >  e.j-i  +  2  (if  Ji  =  0,  we  define  e\  =  0). 

•  +  1,  •  • ,  ifc  —  1}  C  If  where  c  denotes  set  complement. 

Furthermore,  for  such  a  decomposition  we  can  define  the  set 

g  A  f  0  if  l  =  1, 

\  {iei  +  1 ,...  ,if2  —  1}  otherwise, 


(161) 


Remark  1  is  true  due  to  the  fact  that  link  rates  are  finitely  bounded,15  while  Remark  2  is  a  notational  description  of  the 
obvious  fact  that  any  index  set  can  be  uniquely  decomposed  into  non-overlapping  blocks  of  consecutive  indices  starting  from 
low  numbered  indices,  as  shown  in  Fig.  15  (black  nodes  belong  to  I  while  white  ones  don’t;  two  different  examples  are  given 
depending  on  whether  1  el).  The  set  Q  is  essentially  the  set  of  indices  located  between  the  first  and  second  blocks. 
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Fig.  15.  Schematic  decomposition  of  Z  into  Xj.  In  the  second  case,  it  holds  X\  =  0. 


The  following  result  will  also  be  used 

Lemma  V-G.2:  Let  a  non-increasing  finite  sequence  X\  >  X2  >  . . .  >  Xjy  such  that  it  holds  5X=i  x*  —  +  in  for  all 

n  =  1, . . . ,  N.  For  consistency,  define  ao  =  0.  Then,  for  any  index  subset  I  C  {1, . . . ,  N}  consisting  of  k  elements  it  holds 


Xj  <  kH  + 
iei 


n  2  -  k 
ri2  —  n\ 


®ni 


k  —  rii 
ri2  —  n\ 


an2 ) 


062) 


where  k  =  \I\,  n\  =  \I\\,  m  =  |G|,  712  =  m  +  k  and  I\,  Q  are  defined  in  Remark  2.  The  above  inequality  is  meaningful  when 

?i2  >  iii- 


Proof:  We  explicitly  use  the  notation  of  Remark  2  and  distinguish  cases  as  follows 
1^1.  Then,  by  definition  it  holds  I\  =0,  n-\  0  and  the  desired  inequality  becomes 

k  7,  ,  ev  .  k 


Xi  <  kH 


m 


■  a0 


iei 


'  1m+k  ^  E  Xi  <  kH  + 

te  1 


1  dm+k' 


063) 


15in  fact,  it  is  easy  to  show  that  (158)  is  true  for  any  C  >  rnaxxc{i,...,iv}  nraxc  E^gz  +  E]a 
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If  l  =  1  so  that  to  =  0,  we  have  to  prove  Yhiei  xi  —  kH  +  ak ,  which  is  true  by  assumption.  For  l  >  1,  since  X\  —  Z  and 
xn  is  a  non-increasing  sequence,  the  arithmetic  means  of  non-overlapping  index  sets  satisfies  a  similar  inequality,  i.  e. 


1 

TO 


E*<  - 

ieS 


E  ^ 

*ei 


Em 

x'-~ 


E*- 

iei 


The  monotonicity  of  x,  also  implies 


m+fc 

Xi  <  E  Xi  $  (m  +  &)#  +  «m+fc> 

2£C?  i=l 


(164) 


(165) 


where  the  second  inequality  holds  by  assumption.  Combining  the  last  two  equations  yields 

k 


(y  +  ■*■)  -  (m  +  *0-^  +  am+k  =>  E Xi  <  k.H 


2GX 


iex 


m  - 


'  &ra+/c  5 


(166) 


so  that  (162)  is  proved. 

•  1  S  I,  so  that  X\  /  0.  The  case  l  =  1  is  immediately  eliminated  since  it  implies  m  =  0  and  ri2  =  ni  =  fc.  Hence,  we 
assume  l  >  1  so  that  the  arithmetic  means  inequality  is  still  applicable 

—  V'a;*  >  7—^ —  E  Xj=>y^Xj>  m  Y  Xi,  (167) 

to.  '  k  —  n i  z — '  z— '  k  —  n\  z — ' 

teg  iex2i±i...i±iii  teg  ieX2tti...ttiXi 

and  since  it  holds  _ 

E Xi  +  E  Xi  <  (to  +  fc)g  +  qm+fc,  (168) 

iGX  iGC? 


combining  the  last  two  equations  yields 


Y,Xi  +  k~^  12  xi<{m^+k)H+am+k^y/xi  +  -j-^—^yXi-y  Xi  J  <  (to  +  k)H  +  am+k 


tex 


1  + 


iex2u...ax; 


»  gx 


k  —  ni 


Ki  GX  iGXi 


k  —  n\ 


yXi<{m  +  k)H  +  — —  E  Xi  T  . 


iGX 


iGXi 


(169) 


Using  the  fact  that  x,  <  n\H  +  ani  and  performing  simple  algebra  in  the  last  equation  results  in  (162). 

■ 

(c)  (c) 

We  are  now  in  position  to  prove  Lemma  V-G.  1  inductively,  so  that  we  are  essentially  looking  for  a  sequence  a\  , ... ,  aN 
that  satisfies  (157).  Clearly,  the  proof  is  essentially  complete  if  we  find  such  a  sequence  for  a  specific  c.  Hence,  in  the  remainder 
of  the  proof,  we  drop  the  c  dependence  on  all  quantities  (i.  e.  we  write  C/j(f),Ofe  etc.  instead  of  U^c\t),a^).  Eq.  (157)  is 
trivially  true  for  1  =  0,  since  all  queues  are  initially  empty,  provided  that  a  f1  >  0.  We  assume  that  (157)  holds  for  t  and  we 
examine  what  happens  at  1  +  1. 

Pick  any  set  X  C  {1, . . . ,  TV}  and  decompose  it  as  X  =  XUX,  where  X  =  {*  €  X  :  Ui(t)  >  F}  and  X  =  {i  £  X  :  U%{t)  <  F}. 
We  also  assume,  w.  1.  o.  g,  that  U\(t)  >  U^t)  >  ...  >  C/jv(1)  and  distinguish  cases: 

•  it  holds  1/0.  Using  Remark  1,  the  definition  of  X  and  the  inductive  hypothesis  on  S  yields 

E  uic)  (*+!)<  E  uic)  e t)+c  =  y  y(c)  w + e  ^c)  w  +  £ 

tex  *ex  iex  iGX  (170) 

<  |J|F(c)  +  a{±{  +  |I|F(c)  +  C  <  \X\ F(c)  +  0|i|  +  C  <  |I|F(c)  +  om. 

Since  1/0,  the  hypothesis  is  true  for  1  +  1  provided  it  holds  a^+i  >  +  C  for  all  k  =  1, . . . ,  TV  —  1. 

•  it  holds  X  =  0  and  the  decomposition  of  X  according  to  Remark  2  consists  of  a  single  block  fi.  e.  I  =  1  and  X  =  X  = 
Xi).  Since  X\  contains  consecutive  indices,  it  contains  the  k  =  ii\  largest  backlog  indices.  Additionally,  the  policy,  by 
construction,  allows  bit  transfers  only  from  higher  backlog  nodes  to  lower  backlog.  Hence  any  incoming  traffic  to  a  node 
of  X  must  originate  from  another  node  in  X,  which  implies 

yUi{t  +  l)  =  yUi(t  +  l)  <yUi(t)  <  \X\F  +  a]±l  =  \X\ F  +  am, 

teX  iGX  iGX 

where  the  last  inequality  holds  due  to  the  inductive  hypothesis  for  t. 


(171) 
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it  holds  1=0  and  the  decomposition  of  X  consists  of  two  blocks  or  more  (i.  e.  I  >  2).  Using  Remark  1  and  Lemma  V-G.2 
yields 

7  7  <7 

'y  '  Ui(t  +  1)  5:  'y  '  Ui(t)  +  C  <  kF  H - — - —  ani  4 — - an2  +  C  <  kF  +  a^,  (172) 


n2  -  ni 


n2  -  ni 


where  k  =  \X\  and  m,  n2  are  defined  in  Remark  2.  Hence,  the  inductive  hypothesis  is  true  for  t  +  1  if  it  holds 


ak  > 


max 

ni,n2 
0<ni<fc<7i2<iV  L 


122  —  k  k  —  12i 

@ni  T  t2n2 

n2  —  m  U2  -  n i 


c. 


(173) 


Gathering  the  previous  conditions.  Lemma  V-G.l  is  proved  if  there  exists  a  sequence  a\, . . . ,  a at  that  satisfies  the  following 
conditions 


ak  >  0  Wk  =  0,...,N, 

(tfe+i  >  ak  +  C  Vfc  =  0, . . . ,  N  —  1, 


ak  >  max 
»1,»2 
0<ni<k<n2<N  L 


122  —  k  k  —  12 1 

“h  Un2 

n2  —  m  122  —  n  i 


(174) 


+  C  Vfc  =  1, . . . , TV  —  1, 


where  we  define  a o  =  0.  Since  the  coefficients  of  ani,  an,2  form  a  convex  combination,  a  sequence  satisfying  (174)  can  be 
easily  constructed  as  follows.  Pick  any  strictly  increasing  and  strictly  concave  sequence  6o,...,6jv  with  ha  =  0  so  that  it 
holds16 

122  —  k  k  —  12i 

"  ttn2  ) 


hk  >  max 

ni,n2 

0<ni<k<n2<.N 


122  —  Tl  1 


n2  -  ni 


(175) 


and  define 


Cl  =  min  [6fe+i  h\ , 
0<fe<JV-l 


C2  =  mm 
KfcCJV-l 


bk  - 


max 

711,712 
0<ni<fc<ri2<A^  L 


122  —  k 


k  —  n\ 


i  i  ^712 

n2  —  ni  n2  —  ni 


(176) 


The  strict  monotonicity  and  concavity  imply  that  ci,c2  >  0.  It  now  follows,  after  some  simple  algebra,  that  the  sequence 

C 


hk  .  ,  , 

mm(ci,  c2) 

satisfies  (174),  so  that  the  hypothesis  is  true  for  t  +  1  and  the  induction  is  complete. 


(177) 


□ 


H.  Total  Hidden  Area  S 

We  first  compute  the  total  hidden  area  S  for  transmission  from  station  02  to  station  Oi,  as  illustrated  by  Fig.  16.  The 

distance  between  station  02  and  0\  is  d.  The  CS_Range  and  TX_Range  are  R  and  r  respectively.  In  Fig.  16,  the  grey  area 

Sabd  is  the  hidden  area  that  we  will  compute. 

From  cosine  theorem,  we  get  6  and  ip\ 

6  =  arccos[(r2  +  d2  —  R2)/(2rd)\  (178) 

if  =  arccos[(c?2  +  R2  —  r2)/(2dR)]  (179) 

Then  we  easily  get  the  following  areas 

SaOxB  =  r2(7r  —  9)/2 
SaO!02  =  dr  sin  9/2 
Sao2c  =  R2f/2 

Since  Sabd  =  ‘^{Sao1b  +  -S'aoi02  ~  Sao2c ),  we  have 

r2  (n  —  6)  +  dr  sin  8  —  R2ip  if  r  >  R  —  d 
0  otherwise 

After  the  normalization  procedure  described  later  in  this  section,  the  relationship  is  plotted  in  Fig.  4.  When  R 
is  reduced  to  the  one  established  in  [41]  and  [39]. 


(180) 

(181) 

(182) 


(183) 

=  r,  the  equation 


16  an  example  of  such  a  sequence  is  bj.  =  1  —  e  fc. 
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Fig.  16.  Illustration  of  hidden  area.  The  shaded  region  is  the  hidden  area. 


I.  Hidden  Area  and  Covered  Area  E(i,j) 

As  shown  in  Fig.  6,  and  E(i,j )  are  the  observed  hidden  and  covered  area  in  annulus  A (j )  by  a  source  station  in 

annulus  A{i),  respectively.  When  M  is  large,  the  distance  d  from  a  source  station  in  annulus  A(i)  to  the  AP  approximates  the 
distance  from  the  middle  of  annulus  A(i)  to  the  AP: 

d  =  r(i-l/2)/M  (184) 

In  Fig.  6,  the  hidden  area  H(i,j)  can  be  considered  as  the  difference  between  total  hidden  area  and  S(i,  j  —  1), 

where  S(i,j)  is  the  total  hidden  area  observed  by  a  node  in  annulus  A(i)  when  the  TX_Range  of  the  AP  is  rj/M  (i.e.,  the 
AP  covers  as  far  as  annulus  A{j)).11  In  (183),  by  replacing  TX_Range  r  with  rj/M  and  inserting  the  distance  d  from  (184), 
we  get 

f  i^l?'2(7r-0)  +j(i-  !)sin0]  -  R2ip 

S(*,j)=<  j  =  M  —  i +  1, . . .  ,M  (185) 

[  0  otherwise 

Then  we  get  the  desired  the  hidden  area 

H(iJ)  =  S{i,j)  -  S(i,j  -1)  (186) 

Now  let  us  discuss  the  covered  area  E(i,j).  Each  annulus  A(j)  is  composed  of  E(i,j)  and  So  E(i,j)  is: 

E{i,j)  =  A(j)  —  H(i,j)  (187) 

where  H(i,j)  is  expressed  in  (186)  and  the  area  A(j)  is 

A{j)  =  7rr2[j2  —  (j  —  1)2]/M2  (188) 

For  ease  of  mathematical  manipulation,  we  normalize  the  above  obtained  areas  with  respect  to  the  transmission  area  of  a 
node,  which  is  nr2. 


J.  Derivation  of  Eq.  107 

We  consider  a  station  located  in  annulus  A(i)  (Fig.  5)  of  a  saturated  network  with  N  nodes.  Define  n  G  (0 ,  to)  as  the 
backoff  stage.  The  maximum  backoff  window  size  at  stage  n  is  Wn  =  2 nW .  Let  k  be  the  stochastic  process  representing  the 
reading  of  the  backoff  counter  for  a  given  station.  Then  we  get  a  discrete  two-dimensional  Markov  chain  (n,  k )  (Fig.  17)  to 
model  the  802.11  DCF.  It  is  the  same  as  that  in  [29], 

As  in  [29],  the  collision  probability  Pc{i )  is  assumed  to  be  independent  of  the  state  n  of  the  station.  From  Fig.  17  we  obtain 
the  transition  probabilities  of  the  Markov  chain: 


P{n ,  k\n,  k  +  1}  =  1, 

P{0,  k\n,  0}  =  (1  -  Pc(i))/W0, 
P{n,  k\n  —  1,0}  =  Pc(i)/Wn, 
P{m,  k\m,  0}  =  Pc{i)/Wm, 


k  £  (0,  Wn  —  2), 
n  G  (0,  to); 
k  G  (0,  Wo  —  1), 
n  G  (0,  to); 
k  G  (0,  Wn  —  1), 
n  G  (1,  to); 
fcG(0,Wm-l). 


17This  is  only  for  the  convenience  of  mathematical  manipulation.  The  transmission  ranges  for  all  stations  (including  AP)  are  r,  as  stated  before. 
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(1-Pc(i))/W„ 


Pc(.)/Wm 


Fig.  17.  The  2D  Markov  chain  to  model  802.11  DCF. 


Denote  bn,k  as  the  probability  of  state  (n,k)  in  steady  state.  From  the  above  transition  probabilities,  we  get  the  following 
relations: 

[  bn, o  =  Pc(i)nb0,o,  n  G  (0,  to  -  1); 

J  h  -  h 

\  Om,  0  —  i_  pc(^)^0,0) 

[  bn,k  =  k  G  (0,  Wn  -  1). 

With  these  three  relations,  each  state  probability  bn,k  can  be  expressed  with  6q,o-  Substitute  bnj:  into  the  normalization  condition 


J]  £  bn<k  =  l  (189) 

n= 0  k—0 

we  get  the  value  of  bo  q: 

b  2[1  —  2Pe(i)][l  —  Pc(i)] 

°’°  [1  —  2Pc(i)]{W  +  1)  +  Pc(i)W{\  —  (2Pc(i))m]  ’ 

As  a  transmission  occurs  only  when  the  backoff  counter  is  zero,  the  transmission  probability  r(i)  is  the  summation  of  the 
probabilities  for  the  states  with  k  =  0: 

m 

T(j )  =  Y.  bn, 0  (191) 


which  leads  to  Eq.  107. 


VI.  List  Of  Symbols,  Abbreviations  And  Synonyms 


Notation 

Definition 

DRABP 

Downlink  Rechargeable  Adaptive  Back-Pressure  policy 

NRABP 

Network  Rechargeable  Adaptive  Back-Pressure  policy 

A_ 

equal  by  denition 

W) 

vector  of  channel  states  for  slot  t 

W) 

transmission  power  vector  for  slot  t  selected  by  the  network  control  policy 

Vs 

set  of  power  vectors  where  P(t)  must  belong  when  it  holds  S(t)  =  s 

K 

set  of  commodities 

Al(t) 

amount  of  exogenously  generated  bits  during  slot  t  intended  for  user  l 

An>  (t) 

amount  of  commodity  c  bits  exogenously  generated  at  node  n  during  slot  t 

W),  Ui(t) 

amount  of  bits  stored  at  time  t  in  the  transport/network  layer,  respectively,  queue  of  user  l 

UW) 

amount  of  commodity  c  bits  stored  at  time  t  in  the  transport/network  layer,  respectively,  queue  of  node  n 

En{t) 

battery  level  of  node  n  at  time  t  (in  a  downlink  scenario,  we  just  write  E(t)) 

Bn(t) 

amount  of  replenished  energy  during  slot  t  due  to  battery  recharge  at  node  n  (in  a  downlink  scenario,  we  just  write  B(t)) 

Rl,in  (f  ) 

number  of  bits  (intended  for  user  /)  admitted  into  the  network  layer  at  slot  t 

number  of  commodity  c  bits  admitted  into  the  network  layer  at  node  n  and  slot  t 

X 

time  average  of  quantity  x  (depending  on  context,  may  also  denote  expected  value) 

X 

deterministic  upper  bound  imposed  on  quantity  x 

im 

number  of  bits  actually  transmitted  on  link  /  during  slot  t 

nit) 

time  average  of  Ri(0)  ...  Ri(t  —  1) 

r (t) 

time  average  of  R„^(0)  . . .  P!n\t  —  1) 

E[S] 

expectation  of  quantity  x 

m 

amount  of  bits  stored  at  time  t  in  virtual  bit  queue  of  user  l  (downlink  scenario  only) 

Dn(t) 

amount  of  bits  stored  at  time  t  in  virtual  power  queue  of  node  n  (in  a  downlink  scenario,  we  just  write  D{t)) 

Y^(t) 

amount  of  commodity  c  bits  stored  at  time  t  in  virtual  bit  queue  of  node  n 

TX_Range 

Transmission  Range 

CS_Range 

Carrier  Sensing  Range 

IF_Range 

Interference  Range 

a 

duration  of  a  successful  transmission 

(3 

duration  of  a  collision  period 

s 

duration  of  a  backoff  time  slot 

w 

initial  backoff  eindow  size 

Pc 

collision  probability 

T 

transmission  probability 

A(i) 

annulus  i 

tRTS 

RTS  transmission  time 

Ps(i) 

conditional  success  probability  given  that  a  node  in  annulus  A(i)  initiates  a  transmission 

TW) 

conditional  collision  probability  given  that  a  node  in  annulus  A{i)  initiates  a  transmission 

Pi  die 

probability  that  all  stations  are  idle  at  a  time  slot 

expected  number  of  stations  located  at  annulus  A  (i) 

Psuccess 

probability  that  is  a  successful  transmission  in  a  time  slot 

P collision 

probability  that  is  a  collision  in  time  slot 

s 

expected  aggregate  throughput  for  all  network 

T 

mean  duration  of  a  time  slot 

m 

per  node  throughput  in  annulus  A(i) 
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